(12) INTERNATIONAL .WLlCATION PUBLISHED UNDER THE PAT£ll|POOPERATION TREATY (PCT) 



(19) World Intellectual Property Organization 

International Bureau 

(43) International Publication Date 
25 April 2002 (25.04.2002) 




PCT 



illlllliililillilillli 

(10) International Publication Number 

wo 02/33099 A2 



(51) International Patent Classification^: C12N 15/54, 
9/12, AOIK 67/027, C07K 16/40, C12Q 1/68, 1/48, GOIN 
33/50, A61K 38/45 

(21) loternational Application Number: PCT/USO 1/47728 

(22) International Filing Date: 20 October 2001 (20.10.2001) 



(25) Filing Language: 

(26) Publication Language: 



English 
English 



(30) Priority Data: 

60/242,410 
60/244,068 
60/245,708 
60/247,672 
60/249,565 
60/252,730 
60/250,807 



20 October 
27 October 
3 November 
9 November 
16 November 
22 November 
1 December 



2000 (20.10.2000 
2000(27.10.200 
2000(03.11.200 
2000 (09.11.2000 
2000(16.11.2000 
2000(22.11.2000 
2000(01.12.2000 



US 
US 
US 

US 
US 

us 

US 



(71) Applicant (for all designated States except US): INC YTE 
GENOMICS, INC. [US/US]; 3160 Porter Drive, Palo 
Alto, CA 94304 (US). 

(72) Inventors; and 

(75) Inventors/Applicants (for US only): GURURAJAN, 
Rajagopal [US/US]; 5591 Dent Avenue, San Jose, CA 
95118 (US). BAUGHN, Mariah, R. [US/US]; 14244 
Santiago Road, San Leandro, CA 94577 (US). WALU, 
Narinder, IC [US/US]; 890 David Street, #205, San Lean- 
dro, CA 94577 (US). ELLIOTT, Vicki, S. [US/US]; 3770 
Polton Place Way, San Jose, CA 95121 (US). XU, Yuming 
[US/US]; 1739 Walnut Drive, Mountain View, CA 94040 
(US). ARVIZU, Chandra [US/US]; 490 Sherwood Way 
#1, Menlo Park, CA 94025 (US). YAO, Monique, G. 
[US/US]; 1189 Woodgate Drive, Carmel, IN 46033 (US). 
RAMKUMAR, Jaya [IN/US]; 34359 Maybird Circle, 
Fremont, CA 94555 (US). DING, Li [CN/US]; 3353 Alma 
Street #146, Palo Alto, CA 94306 (US). TANG, Y., Tom 
[US/US]; 4230 Ranwick Court, San Jose, CA 95 1 18 (US). 
HAFALIA, April, J., A. [US/US]; 2227 Calle de Priraav- 
era, Santa Clara, CA 95054 (US). NGUYEN, Danniel, 
B. [US/US]; 1403 Ridgewood Drive, San Jose, CA 951 18 
(US). GANDHI, Ameena, R. [US/US]; 705 5th Avenue, 
San Francisco, CA 94118 (US). LU, Yan [CN/US]; 
3885 Cormia Way, Palo Alto, CA 94303 (US). YUE, 



(54) Title: HUMAN KINASES 



< 



Henry [US/US); 826 Lois Avenue, Sunnyvale, CA 94087 
(US). BURFORD, Neil [GB/US]; 105 Wildwood Circle, 
Durham, CT 06422 (US). SANDMAN, Olga [US/US]; 
366 Anna Avenue, Moutain View, CA 94043 (US). TRI- 
BOULEY, Catherine, M. [FR/US]; 1121 Tennessee, #5, 
San Francisco, CA 94107 (US). LAL, Preetl, G. [IN/US]; 
RO. Box 5142, Santa Clara, CA 95056 (US). RECIPON, 
Shirley, A. [US/US]; 85 Fortuna Avenue, San Francisco, 
CA 941 15 (US). LU, Dyung, Aina, M. [US/US]; 233 Coy 
Drive, San Jose, CA 95123 (US). BOROWSKY, Mark, 
L. [US/US]; 122 Orchard Avenue, Redwood City, CA 
94061 (US). THORNTON, Michael [US/US]; 9 Medway 
Road, Woodside, CA 94062 (US). SWARNAKER, Anita 
[CA/US]; 8 Locksley Avenue, #5D, San Francisco, CA 
94122 (US). THANGAVELU, Kavitha [IN/US]; 1950 
Montecito Avenue #23, Mountain View, CA 94304 (US). 
KHAN, Farrah, A. [IN/US]; 3617 Central Road, #102, 
Glenview, XL 60025 (US). ISON, Craig, H. [US/US]; 
1242 Weathersfield Way, San Jose, C A 951 18 (US). 

(74) Agents: HAMLET-COX, Diana et al.; Incyte Genomics, 
Inc., 3174 Porter Drive, Palo Alto, CA 94304 (US). 

(81) Designated States (national): AE, AG, AL, AM, AT, AU, 

AZ, BA, BB, BG, BR, BY, BZ, CA, CH, CN, CO, CR, CU, 
CZ, DE, DK, DM, DZ, EC, EE, ES, FI, GB, GD, GE, GH, 
GM, HR, HU, ID, IL, EN, IS, JP, KE, KG, KP, KR, KZ, LC, 
LK, LR, LS. LT, LU, LV, MA. MD, MG, MK, MN, MW. 
MX, MZ, NO, NZ, PL, PT, RO, RU, SD, SE, SG, SI, SK, 
SL, TJ, TM, TR, TT, TZ, UA, UG, US, UZ, VN, YU, ZA, 
ZW. 

(84) Designated States (regional): ARIPO patent (GH, GM, 
KE, LS, MW, MZ, SD, SL, SZ, TZ, UG, ZW), Eurasian 
patent (AM, AZ, BY, KG, KZ, MD, RU, TJ, TM), European 
patent (AT, BE. CH, CY. DE, DK. ES. FI. FR, GB, GR, IE, 
rr, LU, MC, NL. PT, SE, TR), OAPI patent (BF, BJ, CF, 
CG, CI. CM, GA, GN, GQ, GW, ML, MR, NE, SN. TD, 
TG). 

Published: 

— without international search report and to be republished 
upon receipt of that report 

For two-letter codes and other abbreviations, refer to the "Guid- 
ance Notes on Codes and Abbreviations" appearing at the begin- 
ning of each regular issue of the PCT Gazette. 



o 



(57) Abstract: The invention provides human human kinases (PKIN) and polynucleotides which identify and encode PKIN. The 
invention also provides expression vectors, host cells, antibodies, agonists, and antagonists. The invention also provides methods for 
diagnosing, treating, or preventing disorders associated with aberrant expression of PKIN. 



wo 02/33099 



PCTAJSOl/47728 



HUMAN KINASES 

TECHNICAL FIELD 
This invention relates to nucleic acid and amino acid sequences of human kinases and to the 
5 use of these sequences in the diagnosis, treatment, and prevention of cancer, immune disorders, 
disorders affecting growth and development, cardiovascular diseases, and lipid disorders, and in the 
assessment of the effects of exogenous compounds on the expression of nucleic acid and amino acid 
sequences of human kinases. 

10 BACKGROUND OF THE INVENTION 

Kinases comprise the largest known enzyme superfamily and vary widely in their target 
molecules. Kinases catalyze the transfer of high energy phosphate groups from a phosphate donor to 
a phosphate acceptor. Nucleotides usually serve as flie phosphate donor in these reactions, with most 
kinases utilizing adenosine triphosphate (ATP). The phosphate acceptor can be any of a variety of 

15 molecules, including nucleosides, nucleotides, lipids, carbohydrates, and proteins. Proteins are 

phosphorylated on hydroxyamino acids. Addition of a phosphate group alters the local charge on the 
acceptor molecule, causing internal conformational changes and potentially influencing mtermolecular 
contacts. Reversible protem phosphorylation is the primary method for regulating protein activity in 
eukaryotic cells. In general, proteins are activated by phosphorylation m response to extraceUular 

20 signals such as hormones, neurotransmitters, and growth and differentiation factors. The activated 
proteins initiate the cell's intracelhilar response by way of intracellular signalmg pathways and second 
messenger molecules such as cyclic nucleotides, calcium-calmodulin, inositol, and various mitogens, 

that regulate protein phosphorylation. 

Kinases are involved in all aspects of a cell's function, from basic metabolic processes, such 

25 as glycolysis, to cell-cycle regulation, differentiation, and communication with the extracellular 

environment through signal transduction cascades. Inappropriate phosphorylation of proteins in cells 
has been linked to changes in cell cycle progression and cell differentiation. Changes in the cell cycle 
have been linked to induction of apoptosis or cancer. Changes in cell differentiation have been linked 
to diseases and disorders of the reproductive system, immune system, and skeletal muscle. 

30 There are two classes of protein kinases. One class, protein tyrosine kinases (PTKs), 

phosphorylates tyrosine residues, and the other class, protein serine/threonine kinases (STKs), 
phosphoiylates serine and threonine residues. Some PTKs and STKs possess structural 
characteristics of both families and have dual specificity for both tyrosine and serine/threonine 
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residues. Almost all kinases contain a conserved 250-300 amino acid catalytic domain containing 
specific residues and sequence motifs characteristic of the kinase family. The protein kinase catalytic 
domain can be further divided into 1 1 subdomains. N-terminal subdomains I-IV fold into a two-lobed 
structure which binds and orients the ATP donor molecule, and subdomain V spans the two lobes. C- 

S terminal subdomains VI-XI bind the protein substrate and transfer the gamma phosphate from ATP to 
the hydroxyl group of a tyrosine, serine, or threonine residue. Each of the 1 1 subdomains contains 
specific catalytic residues or amino acid motifs characteristic of that subdomain. For example, 
subdomain I contains an 8-amino acid glycine-rich ATP binding consensus motif, subdomain n 
contains a critical lysine residue required for maximal catalytic activity, and subdomains VI through DC 

10 comprise the highly conserved catalytic core. PTKs and STKs also contain distinct sequence motifs 

» • 

in subdomains VI and Vm which may confer hydroxyamino acid specificity. 

In addition, kinases may also be classified by additional amino acid sequences, generally 
between S and 100 residues, which either flank or occur within the kinase domain. These additional 
amino acid sequences regulate kinase activity and determine substrate specificity. (Reviewed in 

15 Hardie, G. and Hanks, S. (1995) The Protem Kinase Facts Book, Vol I p.p. 17-20 Academic Press, 
San Diego, CA.). In particular, two protein kinase signature sequences have been identified in the 
kinase domain, the first containing an active site lysine residue involved in ATP binding, and the 
second containing an aspartate residue important for catalytic activity. If a protein analyzed includes 
the two protein kinase signatures, die probability of that protein being a protein kinase is close to 100% 

20 (PROSITE: PDOCOOlOO, November 1995). 
Protein Tvrosine Kinases 

Protein tyrosine kinases (PTKs) may be classified as either transmembrane, receptor PTKs 
or nontransmembrane, nonreceptor PTK proteins. Transmembrane tyrosine kinases function as 
receptors for most growth factors. Growth factors bind to the receptor tyrosine kinase (RTK), which 

25 causes the receptor to phosphorylate itself (autophosphorylation) and specific intracellular second 
messenger proteins. Growth factors (GF) that associate with receptor PTKs include epidermal GF, 
platelet-derived GF, fibroblast GF, hepatocyte GF, insulin and insulin-like GFs, nerve GF, vascular 
endothelial GF, and macrophage colony stimulating factor. 

Nontransmembrane, nonreceptor PTKs lack transmembrane regions and, instead, form 

30 signaling complexes with the cytosolic domains of plasma membrane receptors. Receptors that 

function through non-receptor PTKs include those for cytokines and hormones (growth hormone and 
prolactin), and antigen-specific receptors on T and B lymphocytes. 

Many PTKs were first identified as oncogene products in cancer cells in which PTK 
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activadon was no longer subject to normal cellular controls. In fact, about one third of the known 
oncogenes encode PTKs. Furthermore, celhilar transformation (oncogenesis) is often accompanied 
by increased tyrosine phosphorylation acti\dty (Qiarbonneau, H and Tonks, N. K (1992) Annu. Rev. 
Cell BioL 8:463-93). Regulation of FTK activity may therefore be an important strategy in controlling 

S some types of cancer. 

Protein Serine/Threonine Kina5;&s 

Protein serine/threonine kinases (STKs) are nontransmembrane protems. A subclass of 
STKs are known as ERKs (extracellular signal regulated kinases) or MAPs (mitogen-activated 
protein kinases) and are activated after cell stimulation by a variety of hormones and growdi factors. 

10 Cell stimulation induces a signaling cascade leading to phosphorylation of MEK (MAP/ERK kinase) 
which, in turn, activates ERK via serine and threonine phosphorylation. A varied number of proteins 
represent flie downstream effectors for the active ERK and implicate it in the control of cell 
proliferation and differentiation, as well as regulation of the cytoskeleton. Activation of ERK is 
normally transient, and cells possess dual specificity phosphatases that are responsible for its down- 

15 regulation. Also, numerous studies have shown that elevated ERK activity is associated with some 
cancers. Other STKs include the second messenger dependent protein kmases such as the 
cyclic- AMP dependent protein kinases (PKA), calcium-calmodulm (CaM) dependent protein kinases, 
and the mitogen-activated protein kinases (MAP); the cyclin-dependent protein kinases; checkpoint 
and cell cycle kinases; proliferation-related kinases; 5 -AMP-activated protein kinases; and kmases 

20 involved in apoptosis. 

The second messenger dependent protem kinases primarily mediate the effects of second 
messengers such as cyclic AMP (cAMP), cyclic GMP, inositol triphosphate, phosphatidylinositol, 
3,4,5-triphosphate, cyclic ADPribose. aradiidonic acid, diacylglycerol and calcium-cahnodulin. The 
PKAs are involved in mediating hormone-induced cellular responses and are activated by cAMP 

25 produced within die cell in response to hormone stimulation. cAMP is an intracellular mediator of 
hormone action in all animal cells that have been studied. Hormone-induced cellular responses include 
thyroid honnone secretion, Cortisol secretion, progesterone secretion, glycogen breakdown, bone 
resorption, and regulation of heart rate and force of heart muscle contraction. PKA is found in all 
animal cells and is thought to account for the effects of cAMP in most of these cells. Altered PKA 

30 expression is implicated in a variety of disorders and diseases including cancer, thyroid disorders, 
diabetes, atherosclerosis, and cardiovascular disease (Isselbacher, K.J. et al. (1994) Harrison's 
Principles of Internal Medicine , McGraw-Hill, New York, NY, pp. 416-431, 1887). 

The casein kinase I (CKI) gene family is another subfamily of serine/tfareomne protein 
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kinases. This contimiously expanding group of kinases have been implicated in the regulation of 
numerous cytoplasmic and nuclear processes, including ceU metabolism, and DNA replication and 
repair. CKI enzymes are present in the membranes, nucleus, cytoplasm and cytoskeleton of 
eukaryotic cells, and on the mitotic spindles of mammalian cells (Fish, KJ. et al«, (1995) J. Biol. Qiem. 
5 270:14875-14883. 

The CKI family members all have a short amino-terminal domain of 9-76 amino acids, a highly 
conserved kinase domain of 284 amino acids, and a variable carboxyl-terminal domain that ranges 
from 24 to over 200 amino acids in length (Cegielska, A. et aL, (1998) J. BioL Chem. 273:1357-1364.) 
The CKI family is comprised of higjily related proteins, as seen by the identification of isoforms of 

10 casein kinase I from a variety of sources. There are at least five mammalian isoforms, a, p, y> 5, and 
8. Fish et al., identified CKI-epsilon from a human placenta cDNA library. It is a basic protein of 416 
amino acids and is closest to CKI-delta. Hirough recombinant expression, it was determined to 
phosphorylate known CKI substrates and was inhibited by the CKI-specific inhibitor CKI-7. The 
human gene for CKI-epsilon was able to rescue yeast with a slow-growth phenotype caused by 

15 deletion of the yeast CKI locus, HRR250 (Fish et al, supra.) 

The mammalian circadian mutation tau was found to be a semidominant autosomal aUele of 
CKI-epsilon that markedly shortens period length of circadian ifaythms in Syrian hamsters. The tau 
locus is encoded by casein kinase I-epsilon, which is also a homolog of the Drosoplula circadian gene 
double-time. Studies of both the wildtype and tau mutant CKI-epsilon enzyme indicated that the 

20 mutant enzyme has a noticeable reduction in the maximum velocity and autophosphorylation state. 
Farther, in vitro, CKI-epsilon is able to interact with mammalian PERIOD proteins, while the mutant 
enzyme is deficient in its ability to phosphorylate PERIOD. Lowrey et al., have proposed that CKI- 
epsilon plays a major role in delaying the negative feedback signal within the transcription-translation- 
based autoregulatory loop that composes the core of the circadian mechanism. Therefore the CKI- 

25 epsilon enzyme is an ideal target for pharmaceutical compounds influencing circadian rhydims, jet-lag 
and sleep, in addition to other physiologic and metabolic processes under circadian regulation (Lowtey, 
P.L. et al., (2000) Science 288:483-491.) 
Calcium-CalmoduUn Dependent Protein Kinases 

Calcium-calmodulin dependent (CaM) kinases are involved in regulation of smooth muscle 

30 contraction, glycogen breakdown (phosphorylase kinase), and neurotransmission (CaM kinase I and 
CaM kinase II). CaM dependent protein kinases are activated by calmodulin, an intracellular calcium 
receptor, in response to the concentration of fiee calcium in the cell. Many CaM kinases are also 
activated by phosphorylation. Some CaM kinases are also activated by autophosphorylation or by 
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oflier regulatory kinases, CaM kinase I phosphorylates a variety of substrates including the 
neurotransmitter-related proteins synapsin I and E, the gene transcription regulator, CREB, and the 
cystic fibrosis conductance regulator protein, CFTR (Haribabu, B. et al. (1995) EMBO Journal 
14:3679-3686). CaM kinase n also phosphorylates synapsin at different sites and controls the 
5 synthesis of catecholamines in the brain through phosphorylation and activation of tyrosine 
hydroxylase. CaM kinase n controls the synfliesis of catecholamines and seratonin, through 
phosphorylation/activation of tyrosine hydroxylase and tryptophan hydroxylase, respectively (Fujisawa, 
¥L (1990) BioEssays 12:27-29). The mRNA encoding a cahnodulin-binding protein kinase-like protein 
was found to be enriched in mammalian forebrain. This protem is associated with vesicles in both 
10 axons and dendrites and accumulates largely postnatally. The amino acid sequence of this protein is 
similar to CaM-dependent STKs, and the protein binds calmodulin in the presence of calcium 
(Godbout, M. et aL (1994) J. Neurosci. 14:1-13). 
Mitogen- Activated Protein Kinases 

The mitogen-activated protein kinases (MAP) which mediate signal transduction from the cell 
15 surface to the nucleus via phosphorylation cascades are another STK family fliat regulates 

intracelhilar signaUng pafliways. Several subgroups have been identified, and each manifests different 
substrate specificities and responds to distinct extracelhilar stimuli (Egan, S.E and Weinberg, R.A. 
(1993) Nature 365:781-783). MAP kinase signaling pathways are present in mammalian cells as well 
as in yeast. The extracelhilar stimuli which activate MAP kinase pathways include epidermal growth 
20 factor (EGF), ultraviolet light, hyperosmolar medium, heat shock, endotoxic lipopolysaccharide (LPS), 
and pro-inflanunatory cytokines such as tumor necrosis factor (TNF) and interleukin-1 (IL-1). 
Altered MAP kinase egression is implicated in a variety of disease conditions including cancer, 
inflammation, immune disorders, and disorders affecting growth and development 
Cvclin-Denendent Protein Kinases 
25 The cyclm-dependent protein kinases (CDKs) are STKs fliat control the progression of cells 

through the cell cycle. The entry and exit of a cell from mitosis are regulated by the synfliesis and 
destruction of a family of activating protems called cyclins, Cyclins are small regulatory proteins that 
bind to and activate CDKs, which then phosphorylate and activate selected proteins mvolved in the 
mitotic process. CDKs are unique in that they require multiple inputs to become activated. In addition 
30 to cyclin binding, CDK activation requires the phosphorylation of a specific threonine residue and tiie 
dephosphorylation of a specific tyrosine residue on the CDK. 

Another family of STKs associated with die cell cycle are the NIMA (never in mitosis)- 
related kinases (Neks). Both CDKs and Neks are involved in duplication, maturation, and separation 
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of the microtubule organizing center, the centrosome, in animal cells (Fry, A.M., et aL (1998) EMBO 
J. 17:470-481). The NIM-related kinases also include NIKl histidine kinases, which function in signal 
transmission (Yamada-Okabe. T. et al. (1999) J. Bacteriol. 181:7243-7247). 
Checkpoint and Cell Cvcle Kinases 
S In the process of cell division, the order and timing of cell cycle transitions are under control of 

ceU cycle checkpoints, which ensure that critical events such as DNA replication and chromosome 
segregation are carried out with precision. If DNA is damaged, e.g. by radiation, a checkpoint 
pathway is activated that arrests the cell cycle to provide time for repair. If the damage is extensive, 
apoptosis is induced. In the absence of such checkpoints, the damaged DNA is inherited by aberrant 

10 cells which may cause proliferative disorders such as cancer. Protein kinases play an important role 
in this process. For example, a specific kinase, checkpoint kinase 1 (Chkl), has been identified in 
yeast and mammals, and is activated by DNA damage in yeast. Activation of Chkl leads to the arrest 
of the cell at the G2/M transition. (Sanchez, Y. et aL (1997) Science 277:1497-1501.) Specifically, 
Chkl phosphorylates the cell division cycle phosphatase CDC2S, inhibiting its normal function which is 

IS to dephosphorylate and activate the cyclin-dependent kinase Cdc2. Cdc2 activation controls the entry 
of cells into mitosis. (Peng, C-Y et al. (1997) Science 277:1501- 1505.) Thus, activation of Chkl 
prevents the damaged cell from entering mitosis. A similar deficiency in a checkpoint kinase, such as 
Chkl, may also contribute to cancer by failure to arrest cells with damaged DNA at other checkpoints 
such as G2/M. 

20 Proliferation-Related Kinases 

Proliferation-related kinase is a semm/cytokine inducible STK that is involved in regulation of 
the cell cycle and cell proliferation in human megakarocytic cells (li, B. et aL (1996) J. Biol. Chem. 
271:19402-8). Proliferation-related kinase is related to the polo (derived from Drosophila polo gene) 
family of STKs implicated in cell division. Proliferation-related kinase is downregulated in lung tumor 

25 tissue and may be a proto-oncogene whose deregulated expression in normal tissue leads to oncogenic 
transformation. 

The RET (rearranged during transfection) proto-oncogene encodes a tyrosine kinase receptor 
involved in both multiple endocrine neoplasia type 2, an inherited cancer syndrome, and ICrschsprung 
disease, a developmental defect of enteric neurons. RET and its functional ligand, glial ceU line- 
30 derived neurotrophic factor, play key roles in the development of the human enteric nervous system 
(Pachnis, V. et aL (1998) Am. J. Physiol. 275:G183-G186). 
5 -AMP-activated protein Icinase 

A ligand-activated STK protein kinase is 5 -AMP-activated protein kinase (AMPK) (Gao, G. 

6 
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et al. (1996) J. Biol Chem. 271:8675-8681). Mammalian AMPK is a regulator of fatty acid and sterol 
synthesis through phosphorylation of the enzymes acetyl-Co A carboxylase and 
hydroxymethylghitaryl-CoA reductase and mediates responses of these pathways to celhilar stresses 
such as heat shock and depletion of ghicose and ATP. AMPK is a heterotrimeric complex comprised 
5 of a catalytic alpha subunit and two non-catalytic heta and gamma subunits that are believed to 
regulate the activity of the alpha subunit. Subunits of AMPK have a much wider distribution in 
non-lipogenic tissues such as brain, heart, spleen, and hing than expected. This distribution suggests 
that its role may extend beyond regulation of lipid metabolism alone. 
Kinases in Anoptosis 

10 Apoptosis is a highly regulated signaling pathway leading to cell death that plays a crucial role 

in tissue development and homeostasis. Deregulation of this process is associated with the 
pathogenesis of a number of diseases including autoimmune disease, neurodegenerative disorders, and 
cancer. Various STKs play key roles in this process. ZIP kinase is an STK containmg a C-terminal 
leucine zipper domain in addition to its N-terminal protein kinase domam. This C-terminal domain 

15 appears to mediate homodimerization and activation of the kinase as well as interactions with 
transcription factors such as activating transcription factor, ATF4, a member of the cyclic- AMP 
responsive element binding protein (ATF/CREB) family of transcriptional factors (Sanjo, H. et aL 
(1998) J. Biol. Chem, 273:29066-29071). DRAKl and DRAK2 are STKs that share homology with 
the death-associated protein kinases (DAP kinases), known to function in interferon-y mduced 

20 apoptosis (Sanjo et al. supraV Like ZIP kinase, DAP kinases contain a C-terminal protein-protein 
interaction domain, in the form of ankyrin repeats, in addition to the N-terminal kinase domam. ZIP, 
DAP, and DRAK kinases induce morphological changes associated with apoptosis when transfected 
into NIH3T3 cells (Sanjo et al. supraV However, deletion of either the N-termmal kinase catalytic 
domain or the C-terminal domain of these proteins abolishes apoptosis activity, mdicating that in 

25 addition to the kinase activity, activity in the C-terminal domain is also necessary for apoptosis, 
possibly as an interacting domain with a regulator or a specific substrate. 

RICK is another STK recently identified as mediating a specific apoptotic pathway involving 
the death receptor, CD95 (Inohara, N. et al. (1998) J. Biol. Chem. 273:12296-12300). CD95 is a 
member of die tumor necrosis factor receptor superfamily and plays a critical role in the regulation 

30 and homeostasis of die immune system (Nagata, S. (1997) Cell 88:355-365). The CD95 receptor 
signaling pathway involves recruitment of various mtracellular molecules to a receptor complex 
following ligand binding. This process mchades recruitment of the cysteine protease caspase-8 which, 
in turn, activates a caspase cascade leading to cell death. RICK is composed of an N-terminal kinase 
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catalytic domain and a C-tenninal "caspase-recniitment" domain that interacts with caspase*like 
domains, indicating that RICK plays a role in the recruitment of caspase-8. This interpretation is 
supported by the fact that the expression of RICK in human 293T cells promotes activation of 
caspase-8 and potentiates the induction of apoptosis by various proteins involved in the CD95 
5 apoptosis pathway (Inohara et al. supra}. 

Mitochondrial Prntftin Kinajsas 

A novel class of eukaryotic kinases, related by sequence to prokaryotic histidine protein 
kinases, are the mitochondrial protein kinases (MPKs) which seem to have no sequence similarity with 
other eukaryotic protein kinases. These protein kinases are located exclusively in the mitochondrial 

10 matrix space and may have evolved from genes originally present in respiration-dependent bacteria 
which were endocytosed by primitive eukaryotic cells. MPKs are responsible for phosphorylation and 
inactivation of the branched-chain alpha-ketoacid dehydrogenase and pyruvate dehydrogenase 
complexes (Harris, R.A. et al. (1995) Adv. Enzyme Regul. 34:147-162). Five MPKs have been 
identified. Four members correspond to pyruvate dehydrogenase kinase isozymes, regulating the 

IS activity of the pyruvate dehydrogenase complex, which is an important regulatory enzyme at die 
interface between glycolysis and the citric acid cycle. The fifth member corresponds to a branched- 
chain alpha-ketoacid dehydrogenase kinase, important in the regulation of the pathway for the disposal 
of branched-chain amino acids. (Harris, R.A. et al. (1997) Adv. Enzyme Regul. 37:271-293). Both 
starvation and the diabetic state are known to result in a great increase in the activity of the pyruvate 

20 dehydrogenase kinase in the liver, heart and muscle of the rat This increase contributes in both 
disease states to the phosphorylation and inactivation of the pyruvate dehydrogenase complex and 
conservation of pyruvate and lactate for ghiconeogenesis (Harris (199S) supra) . 

IQNASES WITH NON-PROTEIN SUBSTRATES 

25 

Lipid and Inositol kinases 

Lipid kinases phosphorylate hydroxyl residues on lipid head groups. A family of kinases 
involved in phosphorylation of phosphatidylinositol (PI) has been described, each member 
phosphorylating a specific carbon on the inositol ring (Leevers, SJ. et al. (1999) Curr. Opin. Cell. Biol. 
30 1 1 :2 1 9-225). The phosphorylation of phosphatidylinositol is involved in activation of the protein kinase 
C signaling pathway. The inositol phospholipids (phosphoinositides) intracellular signaling pathway 
begins with binding of a signaling molecule to a G-protein linked receptor in the plasma membrane. 
This leads to the phosphorylation of phosphatidylinositol (PI) residues on the inner side of the plasma 

S 
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membrane by inositol kinases^ thus converting PI residues to the biphosphate state (PIP2). PIPj is 
then cleaved into inositol triphosphate (IP3) and diacylglycerol. These two products act as mediators 
for separate signaling pafliways. Cellular responses that are mediated by these pathways are 
glycogen breakdown in the liver in response to vasopressin, smooth muscle contraction in response to 

S acetylcholine, and thrombin-induced platelet aggregation. 

PI 3-kinase (PI3K), which phosphorylates the D3 position of PI and its derivatives, has a 
central role in growth factor signal cascades involved in cell growth, differentiation, and metabolism. 
PI3K is a heterodimer consisting of an adapter subunit and a catalytic subunit. The adapter subunit 
acts as a scaffolding protein, interacting with specific tyrosine-phosphorylated proteins, lipid moieties, 

10 and other cytosolic factors. When the adapter subunit binds tyrosine phosphorylated targets, such as 
the insulin responsive substrate (IRS)-l, the catalytic subunit is activated and converts PI (4,5) 
bisphosphate (PIP2) to PI (3,4,5) P3 (PIP3). PIP3 then activates a number of other proteins, inchidmg 
PKA, protein kinase B (PKB), protein kinase C (PKC), glycogen synthase kinase (GSK)-3, and p70 
ribosomal s6 kinase. PBK also interacts directly with the cytoskeletal organizing proteins, Rac, rho, 

15 and cdc42 (Shepherd, P.R., et al. (1998) Biochem. J. 333:471-490). Animal models for diabetes, such 
as obese and fat mice, have altered PDK adapter subunit levels. Specific mutations in the adapter 
subunit have also been found in an insulin-resistant Danish population, suggesting a role for PDK in 
type-2 diabetes (Shepard, supra) . 

PKC is also activated by diacylglycerol (DAG). Phorbol esters (PE) are analogs of DAG and 

20 tumor promoters that cause a variety of physiological changes when administered to cells and tissues. 
PE and DAG bind to the N-terminal region of PKC. This region contains one or more copies of a 
cysteine-rich domain about 50 amino-acid residues long and essential for DAG/PE-binding. 
Diacylglycerol kinase (DGK), the enzyme that converts DAG into phosphatidate, contains two copies 
of the DAG/PE-binding domain in its N-terminal section (Azzi, A. et al. (1992) Eur. J. BiochenL 

25 208:547-557). 

An example of lipid kinase phosphorylation activity is the phosphorylation of 
D-erythro-sphingosme to the sphingolipid metabolite, sphingosine-l-phosphate (SPP). SPP has 
emerged as a novel lipid second-messenger with both extracellular and intracellular actions (Kohama, 
T. et al. (1998) J. Biol. Chem. 273:23722-23728). ExtraceUularly, SPP is a ligand for flie G-protein 

30 coupled receptor EDG-1 (endothelial-derived, G-protein coupled receptor). Intracellularly, SPP 
regulates ceD growth, survival, motility, and cytoskeletal changes. SPP levels are regulated by 
sphingosine kinases that specifically phosphorylate D-eiythro-sphingosine to SPP. The importance of 
sphingosine kinase in cell signaling is indicated by the fact that various stimuli, including 
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platelet-derived growth factor (PDGF), nerve growth factor, and activation of protein kinase C, 
increase cellular levels of SPP by activation of sphingosine kinase, and the fact that competitive 
inhibitors of the enzyme selectively inhibit cell proliferation induced by PDGF (Kohama et al. supra). 
Purine Nucleotide Kinases 

5 The purine nucleotide kinases, adenylate kinase (ATP: AMP phosphotransferase, or AdK) and 

guanylate kinase ( ATP:GMP phosphotransferase, or GuK) play a key role in nucleotide metabolism 
and are crucial to the synthesis and regulation of cellular levels of ATP and GTP, respectively. These 
two molecules are precursors in DNA and RNA synthesis in growing cells and provide the primary 
source of biochemical energy in cells (ATP), and signal transduction pathways (GTP). Inhibition of 

10 various steps in the synthesis of these two molecules has been the basis of many antiproliferative 
drugs for cancer and antiviral therapy (Pillwein, K. et al. (1990) Cancer Res. 50:1576-1579). 

AdK is found in almost all cell types and is especially abundant in cells having high rates of 
ATP synthesis and utilization such as skeletal muscle. In these cells AdK is physically associated with 
mitochondria and myofibrils, the subcellular structures that are involved in energy production and 

15 utilization, respectively. Recent studies have demonstrated a major function for AdK in transferring 
high energy phosphoryk from metabolic processes generating ATP to cellular components consuming 
ATP ( Zeleznikar, R. J. et al. (1995) J. BioL Chem. 270:73 1 1-73 19). Thus AdK may have a pivotal 
role in maintaining energy production in ceUs, particularly those having a high rate of growth or 
metabolism such as cancer cells, and may provide a target for suppression of its activity to treat 

20 certain cancers. Alternatively, reduced AdK activity may be a source of various metabolic, 
muscle-energy disorders that can result in cardiac or respiratory failure and may be treatable by 
increasing AdK activity. 

GuK, in addition to providing a key step in the synthesis of GTP for RNA and DNA synthesis, 
also fulfills an essential function in signal transduction pathways of cells through the regulation of GDP 

25 and GTP. Specifically, GTP binding to membrane associated G proteins mediates the activation of cell 
receptors, subsequent iatracellular activation of adenyl cyclase, and production of the second 
messenger, cyclic AMP. GDP binding to G proteins inhibits these processes. GDP and GTP levels 
also control the activity of certain oncogenic proteins such as p21"' known to be involved in control of 
cell proliferation and oncogenesis (Bos, J.L. (1989) Cancer Res. 49:4682-4689). High ratios of 

30 GTP:GDP caused by suppression of GuK cause activation of p21'" and promote oncogenesis. 
Increasing GuK activity to increase levels of GDP and reduce the GTP:GDP ratio may provide a 
therapeutic strategy to reverse oncogenesis. 

GuK is an important enzyme in the phosphorylation and activation of certain antiviral drugs 
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useful in the treatment of herpes virus infections. These drugs include the guanme homologs acyclovir 
and buciclovir (MiUer, W.H. and Miller R.L. (1980) J. Biol. Chem. 255:7204-7207; Stenberg. K. et al. 
(1986) J. Biol, Chem. 261:2134-2139). Increasing GuK activity in infected cells may provide a 
flierapeutic strategy for augmenting the effectiveness of these drugs and possibly for reducing the 
S necessary dosages of the drugs. 
Pvrimidine Kinases 

The pyiimidine kinases are deoxycytidine kinase and thymidine kinase 1 and 2. Deoxycytidine 
kinase is located in the nucleus, and thymidine kinase 1 and 2 are found in the cytosol (Johansson, M. 
et al. (1997) Proc. NatL Acad. Sci. U.S.A. 94:11941-11945). Phosphorylation of 

10 deoxyribonucleosides by pyrimidme kinases provides an altemative pathway for de novo synthesis of 
DNA precursors. The role of pyrimidine kinases, like purine kinases, in phosphorylation is critical to 
the activation of several chemotherapeuticafly important nucleoside analogues (Araer E.S. and 
Eriksson, S. (1995) Pharmacol. Ther. 67:155-186). 

The discovery of new human kinases, and the polynucleotides encoding them, satisfies a need 

15 in die art by providing new compositions which are useful in the diagnosis, prevention, and treatment 
of cancer, immune disorders, disorders affecting growth and development, cardiovascular diseases, 
and lipid disorders, and in the assessment of the effects of exogenous compounds on die expression of 
nucleic acid and amino acid sequences of human kinases. 

20 SUMMARY OF THE INVENTION 

The invention features purified polypeptides, human kinases, referred to collectively as 
'TJaN-* and individually as *TKIN-1," "PKIN-2," "PKIN-3," "PKIN-4." "PKIN-5," 'TKIN.6," 
•TKIN-7," "PKIN-9," *TKIN-10,^* *TKIN-11," "PKIN-12," "PKIN-13," *TKIN-14," 

•TKIN-15," *TKIN-16," "PKIN-17," *TKIN-18," •TKIN-19," "PKIN-20." *TKIN-21," and *TKIN- 

25 22." In one aspect, the invention provides an isolated polypeptide selected fi-om the group consisting 
of a) a polypeptide comprising an amino acid sequence selected from flie group consisting of SEQ ID 
NO: 1-22, b) a polypeptide comprismg a naturally occurring amino acid sequence at least 90% identical 
to an amino acid sequence selected from the group consisting of SEQ ID NO: 1-22, c) a biologically 
active fragment of a polypeptide having an amino acid sequence selected fi*om the group consisting of 

30 SEQ ID NO: 1-22, and d) an immunogenic fragment of a polypeptide having an amino acid sequence 
selected from the group consisting of SEQ ID NO:l-22. In one altemative, the invention provides an 
isolated polypeptide comprising the amino acid sequence of SEQ ID NO: 1-22. 

The invention further provides an isolated polynucleotide encoding a polypeptide selected from 
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the group consisting of a) a polypeptide comprising an amino acid sequence selected from the group 
consisting of SEQ ID NO: 1-22, b) a polypeptide comprising a naturally occurring amino acid sequence 
at least 90% identical to an amino acid sequence selected from the group consisting of SEQ ID N0:1- 
22, c) a biologically acdve fragment of a polypeptide having an amino acid sequence selected from the 
S group consisting of SEQ ID NO: 1-22, and d) an immunogenic fragment of a polypeptide having an 
amino acid sequence selected from the group consisting of SEQ ID NO: 1-22. In one alternative, the 
polynucleotide encodes a polypeptide selected from the group consisting of SEQ ID NO: 1-22. In 
another altemative, the polynucleotide is selected from the group consisting of SEQ ID NO:23-44. 
Additionally, the invention provides a recombinant polynucleotide comprising a promoter 

10 sequence operably linked to a polynucleotide encoding a polypeptide selected from the group 

consisting of a) a polypeptide comprising an amino acid sequence selected from the group consisting 
of SEQ ID NO: 1-22, b) a polypeptide comprising a naturally occurring amino acid sequence at least 
90% identical to an amino acid sequence selected fix>m the group consisting of SEQ ID NO: 1-22, c) a 
biologically active fragment of a polypeptide having an amino acid sequence selected fix>m tiie group 

15 consisting of SEQ ID NO:l-22, and d) an immunogenic fragment of a polypeptide having an amino 
acid sequence selected from the group consisting of SEQ ID NO: 1-22. In one altemative, the 
invention provides a cell transformed with the recombinant polynucleotide. In another altemative, the 
invention provides a transgenic organism comprising the recombinant polynucleotide. 

The invention abo provides a method for producing a polypeptide selected from the group 

20 consisting of a) a polypeptide comprising an amino acid sequence selected from the group consisting 
of SEQ ID NO: 1-22, b) a polypeptide comprising a naturally occurring amino acid sequence at least 
90% identical to an amino acid sequence selected from the group consisting of SEQ ID NO: 1-22, c) a 
biologically active fragment of a polypeptide having an amino acid sequence selected from the group 
consisting of SEQ ID NO: 1-22, and d) an immunogenic fragment of a polypeptide having an amino 

25 acid sequence selected from the group consisting of SEQ ID NO: 1-22. The method comprises a) 
culturing a cell under conditions suitable for expression of the polypeptide, wherein said cell is 
transformed with a recombinant polynucleotide comprising a promoter sequence operably linked to a 
polynucleotide encoding the polypeptide, and b) recovering the polypeptide so expressed. 

Additionally, the invention provides an isolated antibody which specifically binds to a 

30 polypeptide selected from the group consisting of a) a polypeptide comprising an amino acid sequence 
selected from the group consisting of SEQ ID NO: 1-22, b) a polypeptide comprising a naturally 
occurring amino acid sequence at least 90% identical to an amino acid sequence selected from the 
group consisting of SEQ ID NO: 1-22, c) a biologically active fragment of a polypeptide having an 
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amino acid sequence selected flrom the group consisting of SEQ ID NO:l-22, and d) an immunogenic 
fragment of a polypeptide having an amino acid sequence selected from the group consisting of SEQ 
ID NO:l-22. 

The invention further provides an isolated polynucleotide selected from the group consisting of 

5 a) a polynucleotide comprising a polynucleotide sequence selected from the group consisting of SEQ 
ID NO:23-44, b) a polynucleotide comprising a naturally occurring polynucleotide sequence at least 
90% identical to a polynucleotide sequence selected from the group consisting of SEQ ID NO:23-44, 
c) a polynucleotide complementary to the polynucleotide of a), d) a polynucleotide complementary to 
the polynucleotide of b). and e) an RNA equivalent of a)-d). In one alternative, the polynucleotide 

10 comprises at least 60 contiguous nucleotides. 

Additionally, die invention provides a mediod for detecting a target polynucleotide in a sample, 
said target polynucleotide having a sequence of a polynucleotide selected from the group consisting of 
a) a polynucleotide comprising a polynucleotide sequence selected from the group consisting of SEQ 
ID NO:23-44, b) a polynucleotide comprising a naturally occurring polynucleotide sequence at least 

15 90% identical to a polynucleotide sequence selected from the group consisting of SEQ ID NO:23-44, 
c) a polynucleotide complementary to the polynucleotide of a), d) a polynucleotide complementary to 
the polynucleotide of b), and e) an RNA equivalent of a)-d). The method comprises a) hybridizing the 
sample with a probe comprising at least 20 contiguous nucleotides comprising a sequence 
complementary to said target polynucleotide in the sample, and which probe specifically hybridizes to 

20 said target polynucleotide, under conditions whereby a hybridization complex is formed between said 
probe and said target polynucleotide or fragments thereof, and b) detecting the presence or absence of 
said hybridization complex, and optionally, if present, the amount thereof. In one alternative, the probe 
comprises at least 60 contiguous nucleotides. 

The invention further provides a method for detecting a target polynucleotide m a sample, said 

25 target polynucleotide having a sequence of a polynucleotide selected from the group consisting of a) a 
polynucleotide comprising a polynucleotide sequence selected from the group consisting of SEQ ID 
NO:23-44, b) a polynucleotide comprising a naturally occurring polynucleotide sequence at least 90% 
identical to a polynucleotide sequence selected from the group consisting of SEQ ID NO:23-44, c) a 
polynucleotide complementary to the polynucleotide of a), d) a polynucleotide complementary to the 

30 polynucleotide of b), and e) an RNA equivalent of a)-d). The method comprises a) amplifying said 
target polynucleotide or fragment thereof using polymerase chain reaction amplification, and b) 
detecting the presence or absence of said amplified target polynucleotide or fragment thereof, and, 
optionally, if present, the amount thereof. 
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The invention iiirther provides a composition comprising an effective amount of a polypeptide 
selected from the group consisting of a) a polypeptide comprising an amino acid sequence selected 
from the group consisting of SEQ ID NO:l-22, b) a polypeptide comprising a naturally occurring 
amino acid sequence at least 90% identical to an amino acid sequence selected from the group 
5 consisting of SEQ ID NO: 1-22, c) a biologically active fragment of a polypeptide having an amino acid 
sequence selected from the group consisting of SEQ ID NO: 1-22, and d) an immunogenic fragment of 
a polypeptide having an amino acid sequence selected from the group consisting of SEQ ID NO:l-22» 
and a pharmaceutically acceptable excipient. In one embodiment, the composition comprises an amino 
acid sequence selected from the group consisting of SEQ ID NO: 1-22. The invention additionally 

10 provides a method of treating a disease or condition associated with decreased expression of 

functional PKIN, comprising administering to a patient in need of such treatment the composition. 

The invention also provides a method for screening a compound for effectiveness as an 
agonist of a polypeptide selected from the group consisting of a) a polypeptide comprising an amino 
acid sequence selected from the group consisting of SEQ ID NO: 1-22, b) a polypeptide comprising a 

15 naturally occurring amino acid sequence at least 90% identical to an amino acid sequence selected 
from the group consisting of SEQ ID NO: 1-22, c) a biologically active fragment of a polypeptide 
having an amino acid sequence selected from the group consisting of SEQ ID NO:l-22, and d) an 
immunogenic fragment of a polypeptide having an. amino acid sequence selected from the group 
consisting of SEQ ID NO: 1-22. Hie method comprises a) exposing a sample comprising the 

20 polypeptide to a compound, and b) detecting agonist activity in the sample. In one alternative, the 
invention provides a composition comprising an agonist compound identified by the method and a 
pharmaceutically acceptable excipient. In another alternative, the invention provides a method of 
treating a disease or condition associated with decreased expression of functional PKIN, comprising 
administering to a patient in need of such treatment the composition. 

25 Additionally, the invention provides a method for screening a compound for effectiveness as 

an antagonist of a polypeptide selected from the group consisting of a) a polypeptide comprising an 
amino acid sequence selected from the group consisting of SEQ ID NO:l-22, b) a polypeptide 
comprising a naturally occurring amino acid sequence at least 90% identical to an amino acid 
sequence selected from the group consisting of SEQ ID NO: 1-22, c) a biologically active fragment of 

30 a polypeptide having an amino acid sequence selected from the group consisting of SEQ ID NO: 1-22, 
and d) an immunogenic fragment of a polypeptide having an amino acid sequence selected from the 
group consisting of SEQ ID NO:l-22. The method comprises a) exposing a sample comprising the 
polypeptide to a compound, and b) detecting antagonist activity in the sample. In one altemative, the 
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invention provides a composition comprising an antagonist compound identified by the method and a 
pharmaceuticaUy acceptable excipient In another alternative, the invention provides a method of 
treating a disease or condition associated with overexpression of functional PKIN, comprising 
administering to a patient in need of such treatment the composition. 

5 The mvention further provides a method of screening for a compound that specifically binds to 

a polypeptide selected from die group consisting of a) a polypeptide comprising an amino acid 
sequence selected from the group consisting of SEQ ID NO:l-22, b) a polypeptide comprising a 
naturally occurring amino acid sequence at least 90% identical to an amino acid sequence selected 
from the group consisting of SEQ ID NO: 1-22, c) a biologically active fragment of a polypeptide 

10 having an ammo acid sequence selected from the group consisting of SEQ ID NO: 1-22, and d) an 
immunogenic fragment of a polypeptide having an amino acid sequence selected fit>m the group 
consisting of SEQ ID NO: 1-22, The method comprises a) combining the polypeptide with at least one 
test compound under suitable conditions, and b) detecting binding of the polypeptide to the test 
compound, thereby identifying a compound that specifically binds to the polypeptide. 

15 The invention further provides a method of screening for a compound that modulates the 

activity of a polypeptide selected from the group consisting of a) a polypeptide comprising an amino 
acid sequence selected from the group consisting of SEQ ID NO: 1-22, b) a polypeptide comprising a 
naturally occurring amino acid sequence at least 90% identical to an amino acid sequence selected 
from the group consisting of SEQ ID NO: 1-22, c) a biologically active fragment of a polypeptide 

20 having an amino acid sequence selected from the group consisting of SEQ ID NO: 1-22, and d) an 
immunogenic fragment of a polypeptide having an annuo acid sequence selected from the group 
consisting of SEQ ID NO: 1-22. The method comprises a) combining the polypeptide with at least one 
test compound under conditions permissive for die activity of the polypeptide, b) assessing the activity 
of the polypeptide in the presence of the test compound, and c) comparing the activity of die 

25 polypeptide in the presence of the test compound with the activity of the polypeptide in the absence of 
the test compound, wherein a change in the activity of the polypeptide in the presence of the test 
compound is indicative of a compound diat modulates the activity of the polypeptide. 

The invention further provides a method for screening a compound for effectiveness in 
altering expression of a target polynucleotide, wherein said target polynucleotide comprises a 

30 polynucleotide sequence selected from the group consisting of SEQ ID NO:23-44, the method 

comprising a) exposing a sample comprising the target polynucleotide to a compound, and b) detecting 
altered expression of the target polynucleotide. 

The invention further provides a mediod for assessing toxicity of a test compound, said 
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method comprising a) treating a biological sample containing nucleic acids with the test compound; b) 
hybridizing the nucleic acids of the treated biological sample with a probe comprising at least 20 
contiguous nucleotides of a polynucleotide selected from the group consisting of i) a polynucleotide 
comprising a polynucleotide sequence selected from the group consisting of SEQ ID NO:23-44, ii) a 
S polynucleotide comprising a naturally occurring polynucleotide sequence at least 90% identical to a 
polynucleotide sequence selected from the group consisting of SEQ ID NO:23-44, iii) a polynucleotide 
having a sequence complementary to i), iv) a polynucleotide complementary to the polynucleotide of 
ii), and v) an RNA equivalent of i)-iv). Hybridization occurs under conditions whereby a specific 
hybridization complex is formed between said probe and a target polynucleotide in the biological 

10 sample, said target polynucleotide selected from the group consisting of i) a polynucleotide comprising 
a polynucleotide sequence selected from the group consisting of SEQ ID NO:23-44, ii) a 
polynucleotide comprising a naturally occurring polynucleotide sequence at least 90% identical to a 
polynucleotide sequence selected from the group consisting of SEQ ID NO:23-44, iii) a polynucleotide 
complementary to the polynucleotide of i), iv) a polynucleotide complementary to the polynucleotide of 

15 ii), and v) an RNA equivalent of i)-iv). Alternatively, the target polynucleotide comprises a fragment 
of a polynucleotide sequence selected from the group consisting of i)-v) above; c) quantifying the 
amount of hybridization complex; and d) comparing the amoimt of hybridization complex in the treated 
biological sample with the amount of hybridization complex in an untreated biological sample, wherein 
a difference in the amount of hybridization complex in the treated biological sample is indicative of 

20 toxicity of the test compound. 

BRIEF DESCRIPTION OF THE TABLES 

Table 1 summarizes the nomenclature for die full length polynucleotide and polypeptide 
sequences of the present invention. 
25 Table 2 shows the GenBank identification number and annotation of the nearest GenBank 

homolog for polypeptides of the invention. The probability score for the match between each 
polypeptide and its GenBank homolog is ako shown. 

Table 3 shows structural features of polypeptide sequences of the invention, including 
predicted motifs and domains, along with the methods, algorithms, and searchable databases used for 
30 analysis of the polypeptides. 

Table 4 lists the cDNA and/or genomic DNA fragments which were used to assemble 
polynucleotide sequences of the invention, along widi selected fragments of the polynucleotide 
sequences. 
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Table 5 shows flie represeatative cDNA library for polynucleotides of the invention. 
Table 6 provides an appendix which describes the tissues and vectors used for construction of 
the cDNA libraries shown in Table 5. 

Table 7 shows die tools, programs, and algorithms used to analyze the polynucleotides and 
5 polypeptides of the invention, along witii applicable descriptions, references, and threshold parameters. 



DESCRIPTION OF THE INVENTION 
Before the present proteins, nucleotide sequences, and methods are described, it is understood 
that this invention is not limited to the particular machines, materials and methods described, as these 
10 may vary. It is also to be understood that the terminology used herein is for the purpose of describing 
particular embodiments only, and is not intended to limit the scope of the present invention which will 
be limited only by the appended claims. 

It must be noted that as used herein and in die appended claims, the singular forms "a," ""an," 
and 'the" include phiral reference unless the context clearly dictates otherwise. Thus, for example, a 
15 reference to "a host cell" includes a plurality of such host cells, and a reference to "an antibody^* is a 
reference to one or more antibodies and equivalents thereof known to those skilled in the art, and so 
forth. 

Unless defined otherwise, all technical and scientific terms used herein have the same 
meanings as commonly understood by one of ordinary skill in the art to which this invention belongs. 

20 Althougih any machines, materials, and methods similar or equivalent to diose described herein can be 
used to practice or test the present invention, the preferred machines, materials and methods are now 
described. All publications mentioned herein are cited for the purpose of describing and disclosing the 
cell lines, protocols, reagents and vectors which are reported in the publications and which might be 
used in connection with the invention. Nothing herein is to be constmed as an admission that the 

25 invention is not entitied to antedate such disclosure by virtue of prior invention. 
DEFINITIONS 

"TKIN" refers to the amino acid sequences of substantially purified PKIN obtained firom any 
species, particularly a mammalian species, including bovine, ovine, porcine, murine, equine, and human, 
and fi'om any source, whether natural, synthetic, semi-synthetic, or recombinant 
30 The term '"agonist" refers to a molecule which intensifies or mimics the biological activity of 

PKIN. Agonists may include proteins, nucleic acids, carbohydrates, smaU molecules, or any other 
compound or composition which modulates the activity of PKIN either by directly interacting wifli 
PKIN or by acting on components of the biological pathway in which PKIN participates. 
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An ''allelic variant is an alternative form of the gene encoding PKIN. Allelic variants may 
result from at least one mutation in the nucleic acid sequence and may result in altered mRNAs or in 
polypeptides whose structure or function may or may not be altered. A gene may have none, one, or 
many allelic variants of its naturally occurring form. Conunon mutational changes which give rise to 
allelic variants are generally ascribed to natural deletions, additions, or substitutions of nucleotides. 
Each of these types of changes may occur alone, or in combination with the others, one or more times 
in a given sequence. 

''Altered'' nucleic acid sequences encoding PKIN include those sequences with deletions, 
insertions, or substitutions of different nucleotides, resulting in a polypeptide the same as PKIN or a 
polypeptide with at least one functional characteristic of PKIN. Included within this definition are 
polymorphisms which may or may not be readily detectable using a particular oligonucleotide probe of 
the polynucleotide encoding PKIN, and improper or unexpected hybridization to allelic variants, with a 
locus other than the normal chromosomal locus for the polynucleotide sequence encoding PKIN. The 
encoded protein may also be "altered," and may contain deletions, insertions, or substitutions of amino 
acid residues which produce a silent change and result in a functionally equivalent PKIN. Deliberate 
amino acid substitutions may be made on the basis of similarity in polarity, charge, solubility, 
hydrophobicity, hydrophilicity, and/or the amphipathic nature of the residues, as long as the biological 
or immunological activity of PKIN is retained. For example, negatively charged amino acids may 
include aspartic acid and glutamic acid, and positively charged amino acids may include lysine and 
arginine. Amino acids widi uncharged polar side chains having similar hydrophilicity values may 
include: asparagine and ghitamine; and serine and threonine. Amino acids with uncharged side chains 
having similar hydrophilicity values may include: leucine, isoleucine, and valine; glycine and alanine; 
and phenylalanine and tyrosine. 

The terms "amino acid" and "amino acid sequence" refer to an oligopeptide, peptide, 
polypeptide, or protein sequence, or a fragment of any of these, and to naturally occurring or synthetic 
molecules. Where "amino acid sequence" is recited to refer to a sequence of a naturally occurring 
protein molecule, "amino acid sequence" and like terms are not meant to limit the amino acid sequence 
to the complete native amino acid sequence associated with the recited protein molecule. 

"Amplification" relates to the production of additional copies of a nucleic acid sequence. 
Amplification is generally carried out using polymerase chain reaction (PCR) technologies well known 
in the art. 

The term "antagonist" refers to a molecule which inhibits or attenuates the biological activity 
of PKIN. Antagonists may include proteins such as antibodies, nucleic acids, carbohydrates, small 
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molecules, or any other compound or composition which modulates the activity of PKIN either by 
directly interacting with PKIN or by acting on components of the biological pathway in which PKIN 
participates. 

The term "antibody"' refers to intact immunoglobulm molecules as well as to fragments 
5 thereof, such as Fab, F(ab')2» and Fv fragments, which are capable of binding an epitopic determmant 
Antibodies that bmd PKIN polypeptides can be prepared using intact polypeptides or using fragments 
containing small peptides of interest as the inununizing antigen- The polypeptide or oligopeptide used 
to immunize an animal (e.g., a mouse, a rat, or a rabbit) can be derived from the translation of RNA, 
or synthesized chemically, and can be conjugated to a carrier protein if desired. Commonly used 
10 carriers that are chemically coupled to peptides include bovine semm albumin, thyroglobulin, and 
kejiiole hmpet hemocyanin (KLH). The coupled peptide is then used to mmiunize the animal. 

The term "antigenic determinant" refers to that region of a molecule (i.e., an epitope) that 
makes contact with a particular antibody. When a protein or a fragment of a protein is used to 
immunize a host animal, numerous regions of the protem may induce the production of antibodies 
15 which bind specifically to antigenic determinants (particular regions or three-dimensional stmctures on 
the protein). An antigenic determinant may compete with the intact antigen (i.e., the mununogen used 
to elicit the immune response) for binding to an antibody. 

The term "aptamer" refers to a nucleic acid or oligonucleotide molecule that binds to a 
specific molecular target. Aptamers are derived from an in vitro evolutionary process (e.g., SELEX 
20 (Systematic Evolution of ligands by Exponential Enrichment), described in U.S. Patent No. 

5,270,163), which selects for target-specific aptamer sequences from large combinatorial libraries. 
Aptamer compositions may be double-stranded or single-stranded, and may include 
deoxyribonucleotides, ribonucleotides, nucleotide derivatives, or other nucleotide-like molecules. The 
nucleotide components of an aptamer may have modified sugar groups (e.g., the 2 -OH group of a 
25 ribonucleotide may be replaced by 2 -F or 2 -NH^), which may improve a desired property, e.g., 

resistance to nucleases or longer lifetime in blood. Aptamers may be conjugated to other molecules, 
e.g., a high molecular weight carrier to slow clearance of die aptamer from the circulatory system. 
Aptamers may be specifically cross-linked to their cognate ligands, e.g., by photo-activation of a 
cross-lmker. (See. e.g., Brody, E.N. and L. Gold (2000) J. BiotechnoL 74:5-13.) 
30 The term "intramef * refers to an aptamer which is expressed in vivo . For example, a vaccinia 

virus-based RNA expression system has been used to express specific RNA aptamers at high levels 
in die cytoplasm of leukocytes (Blind, M. et al. (1999) Proc. Nafl Acad. ScL USA 96:3606-3610). 
The term "spiegelmer** refers to an aptamer which inchxdes L-DNA, L-RNA, or other left- 



19 



wo 02/33099 



PCTAJSOl/47728 



handed nucleotide derivatives or nucleotide-like molecules. Aptamers containing left-handed 
nucleotides are resistant to degradation by naturally occurring enzymes, which normally act on 
substrates containing right-handed nucleotides. 

The term ''antisense" refers to any composition capable of base-pairing with the "sense" 
S (coding) strand of a specific nucleic acid sequence. Antisense compositions may include DNA; RNA; 
peptide nucleic acid (PNA); oligonucleotides having modified backbone linkages such as 
phosphorothioates, methylphosphonates, or benzylphosphonates; oligonucleotides having modified 
sugar groups such as 2 -methoxyethyl sugars or 2 -methoxyethoxy sugars; or oligonucleotides having 
modified bases such as S-methyl cytosine, 2*-deoxyuracil, or 7-deaza-2*-deoxyguanosine. Antisense 

10 molecules may be produced by any method including chenucal synthesis or transcription. Once 
introduced into a cell, the complementary antisense molecule base-pairs with a naturally occurring 
nucleic acid sequence produced by the cell to fonn duplexes which block either transcription or 
translation. The designation '^negative" or "minus" can refer to the antisense strand, and the 
designation "positive" or "phis" can refer to the sense strand of a reference DNA molecule. 

15 The tenn '1>iologically active" refers to a protein having structural, regulatory, or biochemical 

functions of a naturally occurring molecule.. Likewise, "immunologically active" or "immunogenic" 
refers to the capability of the natural, recombinant, or synthetic PKIN, or of any oligopeptide thereof, 
to induce a specific immune response in appropriate animals or cells and to bind widi specific 
antibodies. 

20 "Complementary" describes the relationship between two single-stranded nucleic acid 

sequences that anneal by base-pairing. For example, 5 -AGT-3' pairs with its complement, 
3*-TCA-5'. 

A "composition comprising a given polynucleotide sequence" and a "composition comprising a 
given amino acid sequence" refer broadly to any composition containing the given polynucleotide or 

25 amino acid sequence. The composition may comprise a dry fomiulation or an aqueous solution. 
Compositions comprising polynucleotide sequences encoding PKIN or firagments of PKIN may be 
employed as hybridization probes. The probes may be stored in Ireeze-dried form and may be 
associated with a stabilizing agent such as a carbohydrate. In hybridizations, the probe may be 
deployed in an aqueous solution containing salts (e.g., NaCl), detergents (e.g., sodium dodecyl sulfate; 

30 SDS), and other components (e.g., Denhardt*s solution, dry milk, salmon spenn DNA, etc.). 

"Consensus sequence" refers to a nucleic acid sequence which has been subjected to 
repeated DNA sequence analysis to resolve uncalled bases, extended using the XL-PCR kit (Applied 
Biosystems, Foster City CA) in the 5* and/or the 3* direction, and resequenced, or which has been 
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assembled from one or more overlapping cDNA, EST, or genomic DNA fragments using a computer 
program for fragment assembly, such as flie GELVIEW fragment assembly system (GCG, Madison 
WI) or Phrap (University of Washington, Seattle WA). Some sequences have been both extended 
and assembled to produce the consensus sequence. 

"Conservative amino acid substitutions" are those substitutions that are predicted to least 
interfere with the properties of the original protein, i.e., the structure and especially the function of the 
protein is conserved and not significantly changed by such substitutions. The table below shows amino 
acids which may be substituted for an original amino acid in a protein and which are regarded as 
conservative amino acid substitutions. 



Original Residue 


Conservative Substitution 


Ala 


Gly, Ser 


Arg 


His, Lys 


Asn 


Asp, Gin, His 


Asp 


Asn, Glu 


Cys 


Ala, Ser 


Gin 


Asn, Ght, His 


Ghj 


Asp, Gin, His 


Gly 


Ala 


His 


Asn, Arg, Gin, Glu 


ne 


Leu, Val 


Leu 


He, Val 


Lys 


Arg, Gin, Ghi 


Met 


Leu, lie 


Phe 


His, Met, Leu, Trp, Tyr 


Ser 


Cys, Thr 


Thr 


Ser, Val 


Trp 


Phe, Tyr 


Tyr 


His, Phe, Trp 


Val 


lie. Leu, Thr 



Conservative amino acid substitutions generally nuiint^ (a) the stmcture of the polypeptide 
backbone in the area of the substitution, for example, as a beta sheet or alpha helical conformation, 
(b) the charge or hydrophobicity of the molecule at the site of the substitution, and/or (c) the bulk of 
the side chain. 

A "deletion" refers to a change in the amino acid or nucleotide sequence that results in the 
absence of one or more amino acid residues or nucleotides. 

The term "derivative" refers to a chemically modified polynucleotide or polypeptide. 
Chemical modifications of a polynucleotide can include, for example, replacement of hydrogen by an 
alkyl, acyl, hydroxyl, or amino group. A derivative polynucleotide encodes a polypeptide which retains 
at least one biological or immunological fimction of the natural molecule. A derivative polypeptide is 



21 



wo 02/33099 



PCT/USOl/47728 



one modified by glycosylation, pegylation, or any similar process that retains at least one biological or 
immunological function of the polypeptide from which it was derived. 

A ""detectable label** refers to a reporter molecule or enzyme that is capable of generating a 
measurable signal and is covalently or noncovalently joined to a polynucleotide or polypeptide. 
S ''Differential expression" refers to increased or upregulated; or decreased, downregulated, or 

absent gene or protein expression, determined by comparing at least two different samples. Such 
comparisons may be carried out between, for example, a treated and an untreated sample, or a 
diseased and a normal sample. 

''Exon shuffling** refers to the recombination of different coding regions (exons). Since an 

10 exon may represent a stractural or functional domain of the encoded protein, new proteins may be 
assembled through the novel reassortment of stable substructures, thus allowing acceleration of the 
evolution of new protein functions. 

A '"fragment" is a unique portion of PKD^ or the polynucleotide encoding PKIN which is 
identical in sequence to but shorter in length than the parent sequence. A fragment may comprise up 

15 to the entire lengdi of the defined sequence, minus one nucleotide/amino acid residue. For example, a 
fragment may comprise from 5 to 1000 contiguous nucleotides or amino acid residues. A fragment 
used as a probe, primer, antigen, therapeutic molecule, or for other purposes, may be at least 5, 10, IS, 
16, 20» 25, 30, 40, SO, 60, 7S, 100, ISO, 250 or at least 500 contiguous nucleotides or amino acid 
residues in length. Fragments may be preferentially selected from certain regions of a molecule. For 

20 example, a polypeptide fragment may comprise a certain length of contiguous amino acids selected 
from the first 250 or 500 amino acids (or first 25% or 50%) of a polypeptide as shown in a certain 
defined sequence. Clearly these lengths are exemplary, and any lengOi that is supported by the 
specification, including the Sequence Listing, tables, and figures, may be encompassed by the present 
embodiments. 

25 A fragment of SEQ ID NO:23-44 comprises a region of unique polynucleotide sequence that 

specifically identifies SEQ ID NO:23-44, for example, as distinct from any other sequence in the 
genome from which the fragment was obtained. A fragment of SEQ ID NO:23-44 is useful, for 
example, in hybridization and amplification technologies and in analogous methods that distinguish SEQ 
ID NO:23-44 from related polynucleotide sequences. The precise length of a fragment of SEQ ID 

30 NO:23-44 and the region of SEQ ID NO:23-44 to which the fragment corresponds are routinely 
determmable by one of ordinary skill in the art based on the intended purpose for the firagment. 

A fragment of SEQ ID NO:l-22 is encoded by a fragment of SEQ ID NO:23-44. A 
fragment of SEQ ID NO: 1-22 comprises a region of unique amino acid sequence that specifically 

■ 
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identifies SEQ ID NO:l-22. For example, a fragment of SEQ ID NO:l-22 is useful as an 
immunogenic peptide for the development of antibodies that specifically recognize SEQ ID NO:l-22. 
The precise length of a fragment of SEQ ID NO: 1-22 and the region of SEQ ID NO: 1-22 to which 
the fragment corresponds are routinely determinable by one of ordinary skill in the art based on the 
5 intended purpose for the fragment 

A '"fiiU length" polynucleotide sequence is one containing at least a translation initiation codon 
(e.g., methionine) followed by an open reading frame and a translation termination codon. A ''fiill 
length" polynucleotide sequence encodes a ''fuU length" polypeptide sequence. 

"Homology" refers to sequence similarity or, interchangeably, sequence identity, between two 

10 or more polynucleotide sequences or two or more polypeptide sequences. 

The teruGiS "percent identity^" and "% identity," as applied to polynucleotide sequences, refer to 
the percentage of residue matches between at least two polynucleotide sequences aligned using a 
standardized algoritbnL Such an algorithm may insert, in a standardized and reproducible way, gaps in 
the sequences being compared in order to optimize alignment between two sequences, and therefore 

15 achieve a more meaningful comparison of the two sequences. 

Percent identity between polynucleotide sequences may be determined using the default 
parameters of the CLUSTAL V algorithm as incorporated into the MEGALIGN version 3.12e 
sequence alignment program. This program is part of the LASERGENE software package, a suite of 
molecular biological analysis programs (DNASTAR, Madison WI). CLUSTAL V is described in 

20 Higgins, D.G. and P.M. Sharp (1989) CABIOS 5:151-153 and in Higgins, D.G. et al. (1992) CABIOS 
8:189-191. For pairwise alignments of polynucleotide sequences, the default parameters are set as 
follows: Ktuple=2, gap penalty^5, window=4, and "diagonals saved"=4. The '"weighted" residue 
weight table is selected as the default Percent identic is reported by CLUSTAL V as the ''percent 
similari^' between aligned polynucleotide sequences. 

25 Alternatively, a suite of conmionly used and freely available sequence comparison algoridmis 

is provided by the National Center for Biotechnology Information (NCBI) Basic Local Alignment 
Search Tool (BLAST) (Altschul, S.F. et al. (1990) J. Mol. BioL 215:403-410), which is available from 
several sources, including the NCBI, Bethesda, MD, and on die Internet at 

http://www.ncbi.nlm.nih.gov/BLAST/. The BLAST software suite includes various sequence analysis 
30 programs including ''blastn," that is used to align a known polynucleotide sequence with other 
polynucleotide sequences from a variety of databases. Also available is a tool called ''BLAST 2 
Sequences" that is used for direct pairwise comparison of two nucleotide sequences. "BLAST 2 
Sequences" can be accessed and used interactively at http://www.ncbi.nlm.nih.gov/gorf/bl2.html. The 
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'"BLAST 2 Sequences" tool cau be used for both blastn and blastp (discussed below). BLAST 
programs are conunonly used with gap and other parameters set to defauk settings. For example, to 
compare two nucleotide sequences, one may use blastn with the "'BLAST 2 Sequences'' tool Version 
2.0.12 (April-2 1-2000) set at default parameters. Such default parameters may be, for example: 
5 Matrix: BLOSUM62 

Reward for match: 1 

Penalty for mismatch: -2 

Open Gap: 5 and Extension Gap: 2 penalties 

Gap X drop-off: 50 
10 Expect: 10 

Word Size: 11 

Filter: on 

Percent identity may be measured over the length of an entire defined sequence, for example, 
as defined by a particular SEQ ID number, or may be measured over a shorter length, for example, 

IS over die length of a fragment taken fix>m a larger, defined sequence, for instance, a fragment of at 
least 20, at least 30, at least 40, at least 50, at least 70, at least 100, or at least 200 contiguous 
nucleotides. Such lengths are exemplary only, and it is understood that any firagment length supported 
by the sequences shown herein, in the tables, figures, or Sequence Listing, may be used to describe a 
length over which percentage identity may be measured. 

20 Nucleic acid sequences that do not show a high degree of identity may nevertheless encode 

similar amino acid sequences due to the degeneracy of the genetic code. It is understood that changes . 
in a nucleic acid sequence can be made using this degeneracy to produce multiple nucleic acid 
sequences that all encode substantiaUy the same protein. 

The phrases "percent identity* and "% identity," as applied to polypeptide sequences, refer to 

25 the percentage of residue matches between at least two polypeptide sequences aligned using a 

standardized algorithm. Methods of polypeptide sequence alignment are well-known. Some alignment 
methods take into account conservative amino acid substitutions. Such conservative substitutions, 
explained in more detail above, generally preserve the charge and.hydrophobicity at the site of 
substitution, thus preserving the structure (and therefore function) of the polypeptide. 

30 Percent identity between polypeptide sequences may be determined using the default 

parameters of the CLUSTAL V algorithm as incorporated into the MEGAUGN version 3. 12e 
sequence alignment program (described and referenced above). For pairwise alignments of 
polypeptide sequences using CLUSTAL V, the default parameters are set as follows: Ktuple=l, gap 
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penaltjteS, window=5, and "diagonals savcd"=5. The PAM250 matrix is selected as the default 
residue weight table. As with polynucleotide alignments, the percent identity is reported by 
CLUSTAL V as the "percent similarity"' between aligned polypeptide sequence pairs. 

Alternatively the NCBI BLAST software suite may be used. For example, for a pairwise 
5 comparison of two polypeptide sequences, one may use the "BLAST 2 Sequences** tool Version 
2.0. 12 (April-2 1-2000) with blastp set at default parameters. Such default parameters may be, for 
example: 

Matrix: BWSUM62 

Open Gap: 11 and Extension Gap: 1 penalties 
10 Gap X drop-off: 50 

Expect: 10 
Word Size: 3 
Filter: on 

Percent identity may be measured over the length of an entire defined polypeptide sequence, 

15 for example, as defined by a particular SEQ ID number, or may be measured over a shoiter length, 
for example, over the length of a fragment taken firom a larger, defined polypeptide sequence, for 
mstance, a Augment of at least 15, at least 20, at least 30, at least 40, at least 50, at least 70 or at least 
150 contiguous residues. Such lengths are exemplary only, and it is understood that any fragment 
length supported by the sequences shown herein, m the tables, figures or Sequence Listing, may be 

20 used to describe a length over which percentage identity may be measured. 

'Human artificial chromosomes" (HACs) are linear microchromosomes which may contain 
DNA sequences of about 6 kb to 10 Mb in size and which contain all of the elements required for 
chromosome replication, segregation and maintenance. 

The term 'Tiumanized antibody" refers to an antibody molecule in which the amino acid 

25 sequence in the non-antigen binding regions has been altered so that the antibody more closely 
resembles a human antibody, and still retains its original binding ability. 

^Hybridization" refers to the process by which a polynucleotide strand anneals with a 
complementary strand through base pairing under defined hybridization conditions. Specific 
hybridization is an indication that two nucleic acid sequences share a high degree of complementarity. 

30 Specific hybridization complexes form under permissive annealing conditions and remain hybridized 
after the "washing" step(s). The washing step(s) is particularly important in determining the 
stringency of the hybridization process^ with more stringent conditions aUowing less non-specific 
binding, i.e., binding between pairs of nucleic acid strands that are not perfectly matched. Permissive 
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conditions for annealing of nucleic acid sequences are routinely determinable by one of ordinary skill in 
the ait and may be consistent among hybridization experiments, whereas wash conditions may be 
varied among experiments to achieve the desired stringency, and therefore hybridization specificity. 
Permissive annealing conditions occur, for example, at 68^C in the presence of about 6 x SSC, about 
5 1% (w/v) SDS, and about 100 /xg/ml sheared, denatured salmon sperm DNA. 

Generally, stringency of hybridization is expressed, in part, with reference to die temperature 
under which the wash step is carried out Such wash temperatures are typically selected to be about 
5^ to 20^ lower than the thermal melting point (T^ for the specific sequence at a defined ionic 
strength and pH. The T„ is the temperature (under defined ionic strength and pH) at which 50% of 

10 the target sequence hybridizes to a perfectly matched probe. An equation for calculating T„ and 

conditions for nucleic acid hybridization are well known and can be found in Sambrook, J. et al. (1989) 
Molecular rinninp;! A Laboratory Manual . 2^ ed., voL 1-3, Cold Spring Harbor Press, Plainview NY; 
specifically see volume 2, chapter 9. 

High stringency conditions for hybridization between polynucleotides of the present invention 

15 inchide wash conditions of 68^C in the presence of about 0.2 x SSC and about 0. 1% SDS, for 1 hour. 
Alternatively, temperatures of about 65*'C, 60^C, 55**C, or 42*'C may be used. SSC concentration may 
be varied fit>m about 0. 1 to 2 x SSC, with SDS being present at about 0. 1%. Typically, blocking 
reagents are used to block non-specific hybridization. Such blocking reagents include, for iustance, 
sheared and denatured salmon sperm DNA at about 100-200 /ig/ml. Organic solvent, such as 

20 formamide at a concentration of about 35-50% v/v, may also be used under particular circumstances, 
such as for RNArDNA hybridizations. Useful variations on these wash conditions wiU be readily 
apparent to those of ordinary skill in the art Hybridization, particularly under high stringency 
conditions, may be suggestive of evolutionary siinilarity between the nucleotides. Such similarity is 
strongly indicative of a similar role for the nucleotides and their encoded polypeptides. 

25 The term 'liybridization complex" refers to a complex formed between two nucleic acid 

sequences by virtue of the formation of hydrogen bonds between complementary bases. A 
hybridization complex may be formed in solution (e.g.. Cot or Rot analysis) or formed between one 
nucleic acid sequence present in solution and another nucleic acid sequence immobilized on a solid 
support (e.g., paper, membranes, filters, chips, pins or glass slides, or any other appropriate substrate 

30 to which cells or their nucleic acids have been fixed). 

The words "insertion" and "addition" refer to changes in an amino acid or nucleotide 
sequence resulting in the addition of one or more amino acid residues or nucleotides, respectively. 
"Immune response" can refer to conditions associated with inflammation, trauma, immune 
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disorders, or infectious or genetic disease, etc. These conditions can be characterized by expression 
of various factors, e.g., cytokines, chemokines, and other signaling molecules, which may affect 

cellular and systemic defense systems. 

An "immunogenic fragment" is a polypeptide or oligopeptide fragment of PKIN which is 
5 capable of eliciting an immune response when introduced into a living organism, for example, a 

mammal. The term "immunogenic fragment" also includes any polypeptide or oligopeptide fragment of 
PKIN which is useful in any of the antibody production methods disclosed herein or known in the art 

The term "microarra/' refers to an arrangement of a phirality of polynucleotides, 
polypeptides, or other chemical compounds on a substrate. 
10 The terms "element" and "array element" refer to a polynucleotide, polypeptide, or other 

chemical compound having a unique and defined position on a microarray. 

The term **modulate" refers to a change in die activity of PKIN. For example, modulation 
may cause an increase or a decrease in protein activity, bmding characteristics, or any other biological, 
functional, or immunological properties of PHN. 
15 The phrases "nucleic acid" and "nucleic acid sequence" refer to a nucleotide, oligonucleotide, 

polynucleotide, or any fragment thereof. These phrases also refer to DN A or RNA of genomic or 
synthetic origin which may be single-stranded or double-stranded and may represent the sense or the 
antisense strand, to peptide nucleic acid (PNA). or to any DNA-like or RNA-like material. 

"Operably linked" refers to the situation in which a first nucleic acid sequence is placed in a 
20 fiinctional relationship with a second nucleic acid sequence. For instance, a promoter is operably 
Imked to a coding sequence if the promoter affects the transcription or expression of the coding 
sequence. Operably linked DNA sequences may be in close proximity or contiguous and, where 
necessary to join two protein coding regions, in the same reading frame. 

'Teptide nucleic acid" (PNA) refers to an antisense molecule or anti-gene agent which 
25 comprises an oligonucleotide of at least about 5 nucleotides in lengdi linked to a peptide backbone of 
amino acid residues ending in lysine. The terminal lysine confers sohibility to the compositioiL PNAs 
preferentiaUy bind complementary single stranded DNA or RNA and stop transcript elongation, and 
may be pegylated to extend their lifespan in the celL 

"Post-translational modification" of an PKIN may involve lipidation, glycosylation, 
30 phosphorylation, acetylation, racemization, proteolytic cleavage, and other modifications known in the 
art These processes may occur synthetically or biochemically. Biochemical modifications will vary 
by cell type depending on the enzymatic milieu of PKIN. 

'Trobe" refers to nucleic acid sequences encoding PKIN, their complements, or fragments 
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thereof » which are used to detect identical, allelic or related nucleic acid sequences. Ptobes are 
isolated oligonucleotides or polynucleotides attached to a detectable label or reporter molecule. 
Typical labels include radioactive isotopes, ligands, chemiluminescent agents, and enzymes. "Primers** 
are short nucleic acids, usually DNA oligonucleotides, which may be annealed to a target 
S polynucleotide by complementary base-pairing. Hie primer may then be extended along the target 
DNA strand by a DNA polymerase enzyme. Primer pairs can be used for amplification (and 
identification) of a nucleic acid sequence, e.g., by the polymerase chain reaction (PGR). 

Probes and primers as used in the present invention typically comprise at least IS contiguous 
nucleotides of a known sequence. In order to enhance specificity, longer probes and primers may also 

10 be employed, such as probes and primers fliat comprise at least 20, 25, 30, 40, 50, 60, 70, 80, 90, 100, 
or at least 150 consecutive nucleotides of the disclosed nucleic acid sequences. Probes and primers 
may be considerably longer than these examples, and it is understood that any length supported by the 
specification, including the tables, figures, and Sequence Listing, may be used. 

Methods for preparing and using probes and primers are described in the references, for 

15 example Sambrook, J. et al. (1989) Molecular Cloning: A Laboratory Manual 2^ ed., vol. 1-3, Cold 
Spring Harbor Press, Plainview NY; Ausubel, P.M. et al. (1987) Current Protocols in Molecular 
Biology, Greene Publ. Assoc. & Wiley-Intersciences, New York NY; Innis, M. et aL (1990) PCR 
Protocols. A Guide to Methods and Applications . Academic Press, San Diego CA. PCR primer pairs 
can be derived from a known sequence, for example, by using computer programs intended for that 

20 purpose such as Primer (Version 0.5, 1991, Whitehead Institute for Biomedical Research, Cambridge 
MA). 

Oligonucleotides for use as primers are selected using software known in the art for such 
purpose. For example, OLIGO 4.06 software is usefiil for the selection of PCR primer pairs of up to 
100 nucleotides each, and for the analysis of oligonucleotides and larger polynucleotides of up to 5,000 

25 nucleotides fix>m an input polynucleotide sequence of up to 32 kilobases. Similar primer selection 
programs have incotporated additional features for expanded capabilities. For example, the PrimOU 
primer selection program (available to the public from the Genome Center at University of Texas 
South West Medical Center, Dallas TX) is capable of choosing specific primers from megabase 
sequences and is thus usefiil for designing primers on a genome-wide scope. The PrimerS primer 

30 selection program (available to the public from the Whitehead Institute/MIT Center for Genome 
Research, Cambridge MA) allows the user to input a "mispriming library," in which sequences to 
avoid as primer binding sites are user-specified. PrimerS is useful, in particular, for the selection of 
oligonucleotides for microarrays. (The source code for the latter two primer selection programs may 
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also be obtained from their respective sources and modified to meet the user's specific needs.) The 
PrimeGen program (available to the pubUc fi-om the UK Human Genome Mapping Project Resource 
Centre, Cambridge UK) designs primers based on multiple sequence alignments, thereby allowing 
selection of primers that hybridize to eitiier the most conserved or least conserved regions of aligned 
5 nucleic acid sequences. Hence, diis program is useful for identification of both unique and conserved 
oligonucleotides and polynucleotide fragments. The oligonucleotides and polynucleotide fragments 
identified by any of die above selection methods are useful in hybridization technologies, for example, 
as PGR or sequencing primers, microarray elements, or specific probes to identify fully or partially 
complementary polynucleotides in a sample of nucleic acids. Methods of oligonucleotide selection are 

10 not limited to those described above. 

A "recombinant nucleic acid" is a sequence that is not naturally occurring or has a sequence 
that is made by an artificial combination of two or more otherwise separated segments of sequence. 
This artificial combination is often accomplished by chemical synthesis or, more commonly, by die 
artificial manipulation of isolated segments of nucleic acids, e.g., by genetic engmeering techniques 

15 such as those described in Sambrook, supra . The term recombinant includes nucleic acids fliat have 
been altered solely by addition, substitution, or deletion of a portion of the nucleic acid. Frequently, a 
recombinant nucleic acid may mchde a nucleic acid sequence operably linked to a promoter sequence. 
Such a recombinant nucleic acid may be part of a vector that is used, for example, to transform a cell 
Alternatively, such recombinant nucleic acids may be part of a viral vector, e.g., based on a 

20 vaccinia virus, diat could be use to vaccinate a manmial wherein the recombinant nucleic acid is 
expressed, inducing a protective immunological response in die mammal. 

A "regulatory element" refers to a nucleic acid sequence usually derived from untranslated 
regions of a gene and includes enhancers, promoters, introns, and 5* and 3' untranslated regions 
(UTRs). Regulatory elements mteract widi host or viral proteins which control transcription, 

25 translation, or RNA stability. 

•^Reporter molecules" are chemical or biochemical moieties used for labeling a nucleic acid, 
amino acid, or antibody. Reporter molecules include radionuclides; enzymes; fluorescent, 
chemihimmescent, or chromogenic agents; substrates; cefaclors; mhibitors; magnetic particles; and 

other moieties known in the art. 
30 An **RNA equivalent," in reference to a DNA sequence, is composed of die same Imear 

sequence of nucleotides as the reference DNA sequence with die exception diat all occurrences of 
the nitrogenous base thymine are replaced with uracil, and the sugar backbone is composed of ribose 
instead of deoxyribose. 
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The term '"sample" is used in its broadest sense. A sample suspected of containing PKIN, 
nucleic acids encoding PIGN, or fragments thereof may comprise a bodily fluid; an extract from a cell, 
chromosome, organelle, or membrane isolated from a cell; a cell; genomic DNA, RNA, or cDNA, in 
solution or bound to a substrate; a tissue; a tissue print; etc. 
5 The terms '"specific binding" and ""specifically binding*' refer to that interaction between a 

protein or peptide and an agonist, an antibody, an antagonist, a small molecule, or any natural or 
synthetic binding composition. The interaction is dependent upon the presence of a particular structure 
of the protein, e.g., the antigenic determinant or epitope, recognized by the binding molecule. For 
example, if an antibody is specific for epitope ""A," the presence of a polypeptide comprising the 
10 epitope A, or the presence of free unlabeled A, in a reaction containing free labeled A and the 
antibody will reduce the amount of labeled A diat binds to the antibody. 

The term ""substantially purified" refers to nucleic acid or amino acid sequences that are 
removed from their natural environment and are isolated or separated, and are at least 60% free, 
preferably at least 75% free, and most preferably at least 90% free from other components with 
15 which they are naturally associated. 

A ""substitution" refers to the replacement of one or more amino acid residues or nucleotides 
by different amino acid residues or nucleotides, respectively. 

""Substrate" refers to any suitable rigid or semi-rigid support inctudiug membranes, filters, 
chips, slides, wafers, fibers, magnetic or nonmagnetic beads, gels, tubing, plates, polymers, 
20 microparticles and capillaries. The substrate can have a variety of surface forms, such as wells, 
trenches, pins, channels and pores, to which polynucleotides or polypeptides are bound. 

A ""transcript image" refers to the collective pattern of gene expression by a particular ceU 
type or tissue under given conditions at a given time. 

"Transformation" describes a process by which exogenous DNA is introduced into a recipient 
25 cell. Transformation may occur under natural or artificial conditions according to various methods 
well known in the art, and may rely on any known method for the insertion of foreign nucleic acid 
sequences into a prokaryotic or eukaryotic host cell. The metiiod for transformation is selected based 
on the type of host cell being transformed and may include, but is not limited to, bacteriophage or viral 
infection, electroporation, heat shock, lipofection, and particle bombardment. The term ""transformed 
30 cells" includes stably transformed cells in which the inserted DNA is capable of replication either as 
an autonomously replicating plasmid or as part of the host chromosome, as well as transiently 
transformed cells which express the inserted DNA or RNA for limited periods of time. 

A "transgenic organism,*' as used herein, is any organism, including but not limited to animals 
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and plants, in which one or more of the cells of the organism contains heterologous nucleic acid 
mtroduced by way of human intervention, such as by transgenic techniques well known in the art. The 
nucleic acid is introduced into the cell, directly or indirectly by mtroduction into a precursor of the cell, 
by way of deliberate genetic manipulatipn, sudi as by microinjection or by infection with a 

5 recombmant virus. The term genetic manipulation does not include classical cross-breeding, or in vitro 
fertilization, but rather is directed to the mtroduction of a recombinant DNA molecule. The transgenic 
organisms contemplated in accordance with the present invention inchide bacteria, cyanobacteria, 
fiingi, plants and animals. The isolated DNA of the present invention can be introduced into the host 
by methods known in the art, for example infection, transfection, transformation or transconjugation. 

10 Techniques for transferring the DNA of the present invention into such organisms are widely known 
and provided in references such as Sambrook et al. (1989), supra . 

A **variant" of a particular nucleic acid sequence is defined as a nucleic acid sequence having 
at least 40% sequence identity to the particular nucleic acid sequence over a certain length of one of 
the nucleic acid sequences using blastn with the '*BLAST 2 Sequences" tool Version 2.0.9 (May-07- 

15 1999) set at defauk parameters. Such a pair of nucleic acids may show, for example, at least 50%, at 
least 60%, at least 70%, at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 
93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, or at least 99% or greater 
sequence identity over a certain defined length. A variant may be described as, for example, an 
"allelic" (as defined above), "splice," "species," or "polymorphic" variant A splice variant may have 

20 significant identity to a reference molecule, but will generally have a greater or lesser number of 
polynucleotides due to alternate splicing of exons during mRNA processing. The corresponding 
polypeptide may possess additional functional domains or lack domains that are present in the 
reference molecule. Species variants are polynucleotide sequences that vary from one species to 
another. The resulting polypeptides will generally have significant amino acid identity relative to each 

25 other. A polymorphic variant is a variation in the polynucleotide sequence of a particular gene 

between individuals of a given species. Polymorphic variants also may encompass "single nucleotide 
polymorphisms" (SNPs) in which the polynucleotide sequence varies by one nucleotide base. The 
presence of SNPs may be indicative of, for example, a certain population, a disease state, or a 

propensity for a disease state. 
30 A **variant" of a particular polypeptide sequence is defined as a polypeptide sequence having 

at least 40% sequence identity to the particular polypeptide sequence over a certain length of one of 
the polypeptide sequences using blastp with the "BLAST 2 Sequences" tool Version 2.0.9 (May-07- 
1999) set at default parameters. Such a pair of polypeptides may show, for example, at least 50%, at 
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least 60%, at least 70%, at least 80%, at least 90%, at least 91%, at least 92%, at least 93%, at least 
94%, at least 95%, at least 96%, at least 97%, at least 98%, or at least 99% or greater sequence 
identity over a certain defined lengdi of one of the polypeptides. 

5 THE INVENTION 

The invention is based on the discovery of new human human kinases (PKIN), the 
polynucleotides encoding PKIN, and the use of these compositions for the diagnosis, treatment, or 
prevention of cancer, immune disorders, disorders affecting growth and development, cardiovascular 
diseases, and lipid disorders. 

10 Table 1 summarizes the nomenclature for the full length polynucleotide and polypeptide 

sequences of the invention. Each polynucleotide and its corresponding polypeptide are correlated to a 
single Incyte project identification number (Incyte Project ID). Each polypeptide sequence is denoted 
by both a polypeptide sequence identification number (Polypeptide SEQ ID NO:) and an Incyte 
polypeptide sequence number (Incyte Polypeptide ID) as shown. Each polynucleotide sequence is 

IS denoted by bodi a polynucleotide sequence identification number (Polynucleotide SEQ ID NO:) and an 
Incyte polynucleotide consensus sequence number (Incyte Polynucleotide ID) as shown. 

Table 2 shows sequences with homology to the polypeptides of the invention as identified by 
BLAST analysis against the GenBank protein (genpept) database. Columns 1 and 2 show the 
polypeptide sequence identification number (Polypeptide SEQ ID NO:) and the corresponding Incyte 

20 polypeptide sequence number (Incyte Polypeptide ID) for polypeptides of the invention. Column 3 
shows the GenBank identification number (Genbank ID NO:) of the nearest GenBank homolog. . 
Column 4 shows the probability score for the match between each polypeptide and its GenBank 
homolog. Column S shows the annotation of the GenBank homolog along with relevant citations 
where applicable, all of which are expressly incorporated by reference herein. 

25 Table 3 shows various stmctural features of the polypeptides of the invention. Columns 1 and 

2 show the polypeptide sequence identification number (SEQ ID NO:) and the corresponding Incyte 
polypeptide sequence number (Incyte Polypeptide ID) for each polypeptide of the invention. Column 

3 shows the number of amino acid residues in each polypeptide. Colunm 4 shows potential 
phosphorylation sites, and column 5 shows potential glycosylation sites, as determined by the MOTIFS 

30 program of the GCG sequence analysis software package (Genetics Computer Group, Madison WI). 
Column 6 shows amino acid residues comprising signature sequences, domains, and motifs. Colunm 7 
shows analytical methods for protein structure/function analysis and in some cases, searchable 
databases to which the analytical methods were applied. 

32 



wo 02/33099 




PCT/USOl/47728 



Together, Tables 2 and 3 summarize the properties of polypeptides of the invention, and these 
properties establish that the claimed polypeptides are human kinases. 

For example, SEQ ID NO:l is 91% identical to human casein kinase I-alpha (GenBank ID 
g852055) as determined by the Basic Local Alignment Search Tool (BLAST). (See Table 2.) The 

S BLAST probability score is 2.9e-167» which indicates the probability of obtaining the observed 
polypeptide sequence alignment by chance. SEQ ID NO:l also contains a eukaryotic protein kinase 
domain as determined by searching for statistically significant matches in the hidden Markov model 
(HMM)-based PFAM database of conserved protein family domains. (See Table 3.) Data from 
BLIMPS, MOTIFS, and PROEDLESCAN analyses provide further corroborative evidence that SEQ 

10 ID N0:1 is a protein kinase. 

For example, SEQ ID NO: 10 is 91% identical to Mus muscuhis FYVE finger-containing 
phosphoinositide kinase (GenBank ID g4200446) as determined by the Basic Local Alignment Search 
Tool (BLAST). (See Table 2.) The BLAST probability score is 0.0, which indicates the probability of 
obtaining the observed polypeptide sequence alignment by chance. SEQ ID NO: 10 also contains a 

IS phosphatidyl inositol 4-phosphate S-kinase domain as determined by searching for statistically 

significant matches in the hidden Markov model (HMM)-based PFAM database of conserved protein 
family domains. (See Table 3.) Data from PRODOM analysis provides further corroborative 
evidence that SEQ ID NO: 10 is a phosphoinositide kinase. 

For example, SEQ ID NO: 12 is 71% identical to human seiine/tfareonine protein kinase 

20 (GenBank ID g7 160989) as determined by the Basic Local Aligmnent Search Tool (BLAST). (See 
Table 2.) The BLAST probability score is 1.7e-148, which indicates the probability of obtaining the 
observed polypeptide sequence alignment by chance. SEQ ID NO: 12 also contains a eukaryotic 
protein kinase domain as determined by searching for statistically significant matches in the hidden 
Markov model (HMM)-based PFAM database of conserved protein family domains. (See Table 3.) 

25 Data from BLIMPS and MOTIFS analyses provide further corroborative evidence that SEQ ID 
N0:12 is protein kinase. 

For example, SEQ ID NO: 13 is 86% identical to murine pantothenate kinase 1 beta (GenBank 
ID g6690020) as determined by the Basic Local Alignment Search Tool (BLAST). (See Table 2.) 
The BLAST probability score is 1.6e-129, which indicates the probabiliQr of obtaining the observed 

30 polypeptide sequence alignment by chance. Pantothenate kinase (PanK) is proposed to be the master 
regulator of CoA biosynthesis in mammalian cells, by controlling flux through the CoA biosynihetic 
pathway. Changes ia the level of tissue PanK activitiy is reflected by the concurrent changes in the 
levels of CoA as seen in various metabolic states. Alterations in CoA levels and PanK activity are 
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seen during starvation/feeding, pathological states such as diabetes and by treatment with 
hypolipidemic drags (Rock, CO. et al.. (2000) J. Biol Chem. 275:1377-1383.) 

For example, SEQ ID NO: 16 is 68% identical to Mus muscuhis Nck-interacting kinase-Uke 
embryo specific kinase (GenBank ID g6472874) as determined by the Basic Local Alignment Search 
S Tool (BLAST). (See Table 2.) The BLAST probability score is 0.0, which indicates the probability of 
obtaining the observed polypeptide sequence alignment by chance. SEQ ID NO: 16 also contains a 
eukaryotic protein kinase domain as determined by searching for statistically significant matches in the 
hidden Markov model (HMM)-based PFAM database of conserved protein family domains. (See 
Table 3.) Data from BLIMPS, MOTIFS, and PROFILESCAN analyses provide fiirther 

10 corroborative evidence that SEQ ID NO: 16 is a protein kinase. 

For example, SEQ ID NO: 19 is 99% identical to human RET tyrosine kinase receptor 
(GenBank ID g5419753) as determined by the Basic Local Alignment Search Tool (BLAST). (See 
Table 2.) The BLAST probability score is 0.0, which indicates the probability of obtaining the 
observed polypeptide sequence alignment by chance. SEQ ID NO: 19 also contains a eukaryotic 

15 protein kinase domain as determined by searching for statistically significant matches in the hidden 
Markov model (HMM)-based PFAM database of conserved protein family domains. (See Table 3.) 
Data from BLIMPS, MOTIFS, and PROFILESCAN analyses provide further corroborative evidence 
that SEQ ID NO: 19 is a tyrosme kinase. * 

For example, SEQ ID NO:22 is 33% identical to Gallus gallus smoofli muscle myosin light 

20 chain kinase precursor (GenBank ID g2 12661) as determined by the Basic Local Alignment Search 
Tool (BLAST). (See Table 2.) The BLAST probabiUty score is L2 e.60, which indicates the 
probability of obtaining ihe observed polypeptide sequence alignment by chance. SEQ ID NO:22 also 
contains two eukaryotic protein kinase domains as determined by searching for statistically significant 
matches in the hidden Markov model (HMM)-based PFAM database of conserved protein family 

25 domains. (See Table 3.) Data from BLIMPS, MOTIFS, and PROFILESCAN analyses provide 
fiirther corroborative evidence that SEQ ID NO:22 is a protein kinase. 

SEQ ID NO:2-9, SEQ ID NO:ll, SEQ ID NO:14-15, SEQ ID NO:17-18, and SEQ ID 
NO:20-21 were analyzed and annotated in a similar manner. The algorithms and parameters for the 
analysis of SEQ ID NO: 1-22 are described in Table 7. 

30 As shown in Table 4, the fiiU length polynucleotide sequences of the present invention were 

assembled using cDNA sequences or coding (exon) sequences derived fi'om genomic DNA, or any 
combination of these two types of sequences. Columns 1 and 2 list the polynucleotide sequence 
identification number (Polynucleotide SEQ ID NO:) and the corresponding Incyte polynucleotide 
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consensus sequence number (Incyte Polynucleotide ID) for each polynucleotide of the invention. 
Column 3 shows the length of each polynucleotide sequence in basepairs. Column 4 lists fragments of 
the polynucleotide sequences which are useful, for example, in hybridization or amplification 
technologies that identify SEQ ID NO:23-44 or that distinguish between SEQ ID NO:23-44 and 
5 related polynucleotide sequences. Cohinm 5 shows identification numbers corresponding to cDNA 
sequences, coding sequences (exons) predicted firom genomic DNA, and/or sequence assemblages 
comprised of both cDNA and genomic DNA. These sequences were used to assemble the full length 
polynucleotide sequences of the invention. Cobinms 6 and 7 of Table 4 show the nucleotide start (5') 
and stop (3') positions of tiie cDNA and/or genomic sequences in colunm 5 relative to their respective 

10 fuU length sequences. 

The identification numbers in Column 5 of Table 4 may refer specifically, for example, to 
Incyte cDNAs along with their corresponding cDNA libraries. For example, 183812R7 is the 
identification number of an Incyte cDNA sequence, and CARDNOTOl is the cDNA library from 
which it is derived. Incyte cDNAs for which cDNA libraries are not indicated were derived from 

15 pooled cDNA libraries (e.g., 7 15 83296V 1). Alternatively, die identification numbers in colunm 5 may 
refer to GenBank cDNAs or ESTs which contributed to the assembly of the full length polynucleotide 
sequences. In addition, the identification numbers in colunm 5 may identify sequences derived from 
the ENSEMBL (The Sanger Centre, Cambridge, UK) database (/.«., tiiose sequences including the 
designation "ENST')- Alternatively, the identification numbers in column 5 maybe derived fit)m the 

20 NCBI RefSeq Nucleotide Sequence Records Database (/.e., those sequences mchiding the 

designation **NM" or '*NTO or the NCBI RefSeq Protein Sequence Records (i.e., fliose sequences 
inchiding the designation **NF'). Alternatively, the identification numbers in colunm 5 may refer to 
assemblages of both cDNA and Genscan-predicted exons brought together by an "exon stitching" 
algoritimi. For example, ¥LJOOQO(XJfiJl2J^^^YYY_N^N^ represents a "stitched" sequence in 

25 which XXXXXX is the identification number of the cluster of sequences to which the algorithm was 
applied, and YYYYY is the number of the prediction generated by the algoritiun, and if present, 
represent specific exons that may have been manually edited during analysis (See Example V). 
Alternatively, the identification numbers in column 5 may refer to assemblages of exons brought 
togeflier by an "exon-stretching" algoritimL For example, ¥LKXXXXK_s/iAAAA^BBBB_[Ji is the 

30 identification number of a "stretched" sequence, wifli XXXXXX being the Incyte project identification 
number, gAAAAA being the GenBank identification number of the human genomic sequence to which 
the "exon-stretching" algorithm was applied, gBBBBB being the GenBank identification number or 
NCBI RefSeq identification number of the nearest (jenBank protein homolog, and N referring to 
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specific exons (See Example V). In instances where a RefSeq sequence was used as a protein 
homolog for the "exon-stretching" algorithm, a RefSeq identifier (denoted by ^'NM," "NP," or "NT*) 
may be used in place of the GenBank identifier (Le. , gfiBBBB). 

Alternatively, a prefix identifies component sequences that were hand-edited, predicted firom 
genomic DNA sequences, or derived from a combination of sequence analysis methods. The 
following Table lists examples of component sequence prefixes and corresponding sequence analysis 
methods associated with the prefixes (see Example IV and Example V). 



Prefix 


Type of analysis and/or examples of programs 


GNN. GFG, 
ENST 


Exon prediction from genomic sequences using, for example, 
GENSCAN (Stanford University, CA, USA) or FGENES 
(Computer Genomics Group, The Sanger Centre, Cambridge, UK) 


GBI 


Hand-edited analysis of genomic sequences. 


FL 


Stitched or stretched genomic sequences (see Example V). 


INCY 


Full length transcript and exon prediction from mapping of EST 
sequences to the genome. Genomic location and EST composition 
data are combined to predict the exons and resulting transcript. 



In some cases, Incyte cDNA coverage redundant with tfie sequence coverage shown in 
column 5 was obtained to confirm the final consensus polynucleotide sequence, but the relevant Incyte 
cDNA identification numbers are not shown. 

Table 5 shows the representative cDNA libraries for those full length polynucleotide 
sequences which were assembled using Incyte cDNA sequences. The representative cDNA library 
is the Incyte cDNA library which is most frequently represented by the Incyte cDNA sequences 
which were used to assemble and confirm the above polynucleotide sequences. The tissues and 
vectors which were used to construct the cDNA libraries shown in Table S are described in Table 6. 

Tbe invention also encompasses PKIN variants. A preferred PKIN variant is one which has 
at least about 80%, or alternatively at least about 90%, or even at least about 95% amino acid 
sequence identity to the PKIN amino acid sequence, and which contains at least one fimctional or 
structural characteristic of PKIN. 

The invention also encompasses polynucleotides which encode PKIN. In a particular 
embodiment, the invention encompasses a polynucleotide sequence comprising a sequence selected 
from the group consisting of SEQ ID NO:23-44, which encodes PKIN. The polynucleotide sequences 
of SEQ ID NO:23-44, as presented in the Sequence Listing, embrace the equivalent RNA sequences, 
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wherein occurrences of the nitrogenous base thymine are replaced with uracil, and the sugar 
backbone is composed of ribose instead of deoxyribose. 

The invention also encompasses a variant of a polynucleotide sequence encoding PKIN. In 
particular, such a variant polynucleotide sequence will have at least about 70%, or alternatively at least 

S about 85%, or even at least about 93% polynucleotide sequence identity to the polynucleotide 
sequence encoding PKIN. A particular aspect of the invention encompasses a variant of a 
polynucleotide sequence coniprising a sequence selected from the group consisting of S£Q ID NO:23- 
44 which has at least about 70%, or altemadvely at least about 85%, or even at least about 95% 
polynucleotide sequence identity to a nucleic acid sequence selected from the group consisting of SEQ 

10 ID NO:23-44, Any one of the polynucleotide variants described above can encode an amino acid 
sequence which contains at least one functional or structural characteristic of PKIN. 

It will be appreciated by those skilled in die art that as a result of the degeneracy of the 
genetic code, a multitude of polynucleotide sequences encoding PKIN, some bearing minimal similarity 
to the polynucleotide sequences of any known and naturally occurring gene, maybe produced. Thus, 

15 the invention contemplates each and every possible variation of polynucleotide sequence that could be 
made by selecting combinations based on possible codon choices. These combinations are made in 
accordance with die standard triplet genedc code as applied to the polynucleotide sequence of 
naturally occurring PKIN, and all such variations are to be considered as being specifically disclosed. 
Althougih nucleotide sequences which encode PKIN and its variants are generally capable of 

20 hybridizing to the nucleotide sequence of the naturally occurring PKIN under appropriately selected 
conditions of stringency, it may be advantageous to produce nucleotide sequences encoding PKIN or 
its derivatives possessing a substantiaUy different codon usage, e.g., inclusion of non-naturally 
occurring codons. Codons may be selected to increase the rate at which expression of the peptide 
occurs in a particular prokaryotic or eukaryotic host in accordance with the frequency with which 

25 particular codons are utilized by the host Other reasons for substantially altering the nucleotide 
sequence encoding PKIN and its derivatives widiout altering the encoded amino acid sequences 
include the production of RNA transcripts having more desirable properties, such as a greater half-life, 
than transcripts produced from the naturally occurring sequence. 

The invention also encompasses production of DNA sequences which encode PKIN and 

30 PKIN derivatives, or fragments thereof, entirely by synthetic chemistry. After production, the 

synthetic sequence may be inserted into any of the many available expression vectors and cell systems 
using reagents well known in the art Moreover, synthetic chemistry may be used to introduce 
mutations into a sequence encoding PKIN or any fragment thereof. 
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Also encompassed by the invention are polynucleotide sequences that are capable of 
hybridizing to the claimed polynucleotide sequences, and, in particular, to those shown in SEQ ID 
NO:23-44 and fragments thereof under various conditions of stringency. (See, e.g., Wahl, G.M. and 
S.L. Berger (1987) Methods EnzymoL 152:399-407; Kimmel, A.R. (1987) Methods Enzymol. 152:507- 
S. 511.) Hybridization conditions, including annealing and wash conditions, are describe 

Methods for DNA sequencing are well known in the art and may be used to practice any of 
the embodiments of the invention. The methods may employ such enzymes as flie Klenow fragment 
of DNA polymerase 1, SEQUENASE (US Biochemical, Cleveland OH), Taq polymerase (Applied 
Biosystems), thermostable T7 polymerase (Amersham Pharmacia Biotech, Piscataway NJ), or 

10 combinations of polymerases and proofreading exonucleases such as those found in the ELONGASE 
amplification system (Life Technologies, Gaithersburg MD). Preferably, sequence preparation is 
automated with machines such as the MICROLAB 2200 liquid transfer system (Hamilton, Reno NV)> 
PTC200 thermal cycler (MJ Research, Watertown MA) and ABI CATALYST 800 thermal cycler 
(Applied Biosystems). Sequencing is dien carried out using either the ABI 373 or 377 DNA 

IS sequencing system (Applied Biosystems), the MEGABACE 1000 DNA sequencing system 

(Molecular Dynamics, Sunnyvale CA), or other systems known in the art The resulting sequences 
are analyzed using a variety of algorithms which are well known in the art (See, e.g., Ausubel, P.M. 
(1997) Short Protocols in Molecular Bioloev , John Wiley & Sons, New York NY, unit 7.7; Meyers, 
R.A. (1995) Molecular Biology and Biotechnology . Wiley VCH, New York NY, pp. 856-853.) 

20 The nucleic acid sequences encoding PKIN may be extended utilizing a partial nucleotide 

sequence and employing various PCR-based methods known in the art to detect upstream sequences, 
such as promoters and regulatory elements. For example, one method which may be employed, 
restriction-site PCR, uses universal and nested primers to amplify unknown sequence from genomic 
DNA within a cloning vector. (See. e.g., Sarkar, G. (1993) PCR Methods Applic. 2:318-322.) 

25 Another method, inverse PCR, uses primers that extend in divergent directions to amplify unknown 
sequence from a circularized template. The template is derived from restriction fragments comprising 
a known genomic locus and surroundmg sequences. (See, e.g., Triglia, T. et al. (1988) Nucleic Acids 
Res. 16:8186.) A third method, capture PCR, involves PCR amplification of DNA fragments adjacent 
to known sequences in human and yeast artificial chromosome DNA. (See, e.g., Lagerstrom, M. et 

30 al. (1991) PCR Methods Applic. 1:111-119.) In this method, multiple restriction enzyme digestions and 
ligations may be used to insert an engineered double-stranded sequence into a region of unknown 
sequence before performing PCR. Other methods which may be used to retrieve unknown sequences 
are known in the art. (See, e.g., Parker, J.D. et al. (1991) Nucleic Acids Res. 19:3055-3060). 
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Additionally, one may use PGR, nested primers, and PROMOTERFINDER libraries (Clontech, Palo 
Alto CA) to walk genomic DNA. This procedure avoids the need to screen libraries and is useful in 
finding intron/exon junctions. For all PCR-based methods, primers may be designed using 
commercially available software, such as OUGO 4.06 primer analysis software (National 
5 Biosciences, Plymouth MN) or another appropriate program, to be about 22 to 30 nucleotides in length, 
to have a GC content of about 50% or more, and to anneal to the template at temperatures of about 
68^C to 72°C. 

When screening for ftiU length cDNAs, it is preferable to use libraries that have been 
size-selected to inchide larger cDNAs. In addition, random-primed libraries, which often include 

10 sequences containing the 5* regions of genes, are preferable for situations in which an oligo d(T) 

library does not yield a ftill-length cDNA. Genomic libraries may be useful for extension of sequence 
into S' non-transcribed regulatory regions. 

Capillary electrophoresis systems which are commercially available may be used to analyze 
the size or confirm the nucleotide sequence of sequencing or PGR products. In particular, capillary 

15 sequencing may employ flowable polymers for electrophoretic separation, four different nucleotide- 
specific, laser-stimulated fluorescent dyes, and a charge coupled device camera for detection of the 
emitted wavelengths. Output/light intensity may be converted to electrical signal using appropriate 
softwaie (e.g., GENOTYPER and SEQUENCE NAVIGATOR, Applied Biosystems), and the entire 
process from loading of samples to computer analysis and electronic data display may be computer 

20 controlled. Capillary electrophoresis is especially preferable for sequencing small DNA fragments 
which may be present in limited amounts in a particular sample. 

In another embodiment of the invention, polynucleotide sequences or fragments thereof which 
encode PKIN may be cloned in recombinant DNA molecules that direct expression of PKIN, or 
fragments or functional equivalents thereof, in appropriate host cells. Due to the inherent degeneracy 

25 of the genetic code, other DNA sequences which encode substantially the same or a functionally 
equivalent amino acid sequence may be produced and used to express PKIN. 

The nucleotide sequences of the present invention can be engineered using methods generally 
known in the art in order to alter PKIN-encoding sequences for a variety of purposes including, but 
not limited to, modification of the cloning, processing, and/or expression of the gene product DNA 

30 shuffling by random fragmentation and PCR reassembly of gene fragments and synthetic 

oligonucleotides may be used to engineer the nucleotide sequences. For example, oligonucleotide- 
mediated site-directed mutagenesis may be used to introduce mutations that create new restriction 
sites, alter glycosylation patterns, change codon preference, produce splice variants, and so forth. 
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The nucleotides of the present invention may be subjected to DN A shuffling techniques such 
as MOLECULARBREEDING (Maxygen Inc., Santa Clara CA; described in U.S. Patent No. 
5.837,458; Chang, C.-C. et al. (1999) Nat Biotechnol. 17:793-797; Christians, RC. et al. (1999) Nat. 
BiotechnoL 17:259-264; and Crameri, A. et al. (1996) Nat. Biotechnol. 14:315-319) to alter or improve 

5 the biological properties of PKIN, such as its biological or enzymatic activity or its ability to bind to 
other molecules or compounds. DNA shuffling is a process by which a library of gene variants is 
produced using PCR-mediated recombination of gene fragments. The library is then subjected to 
selection or screening procedures that identify those gene variants with the desired properties. These 
preferred variants may then be pooled and further subjected to recursive rounds of DNA shuffling and 

10 selection/screening. Thus, genetic diversity is created through '"artificial" breeding and rapid molecular 
evolution. For example, fragments of a single gene containing random point mutations may be 
recombined, screened, and then reshuffled until the desired properties are optimized. Alternatively, 
fragments of a given gene may be recombined with fragments of homologous genes in the same gene 
family, either from the same or different species, thereby maximizing the genetic diversity of multiple 

15 naturally occurring genes in a directed and controllable manner. 

In another embodiment, sequences encoding PKIN may be synthesized, in whole or in part, 
using chemical methods weU known in the art. (See, e.g., Camtfaers, M.H. et al. (1980) Nucleic Acids 
Symp. Ser. 7:215-223; and Horn, T. et aL (1980) Nucleic Acids Symp. Ser. 7:225-232.) Alternatively, 
PKIN itself or a fragment thereof may be synthesized using chemical methods. For example, peptide 

20 synthesis can be performed using various solution-phase or solid-phase techniques. (See, e.g., 

Creighton, T. (1984) Proteins. Structures and Molecular Properties . WH Freeman, New York NY, pp. 
55-60; and Roberge, J.Y. et al. (1995) Science 269:202-204.) Automated synthesis maybe achieved 
using the ABI 431 A peptide synthesizer (Applied Biosystems). Additionally, the amino acid sequence 
of PKIN, or any part thereof, may be altered during direct synthesis and/or combined with sequences 

25 from other proteins, or any part thereof, to produce a variant polypeptide or a polypeptide having a 
sequence of a naturally occurring polypeptide. 

The peptide may be substantially purified by preparative high performance liquid 
chromatography. (See, e.g., Chiez, R.M. and RZ. Regnier (1990) Methods Enzymol. 182:392-421.) 
The composition of the synthetic peptides may be confirmed by amino acid analysis or by sequencing. 

30 (See, e.g., Creighton, supra, pp. 28-53.) 

In order to express a biologically active PKIN, the nucleotide sequences encoding PKIN or 
derivatives thereof may be inserted into an appropriate expression vector, i.e., a vector which contains 
the necessary elements for transcriptional and translational control of the inserted coding sequence in 
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a suitable host. These elements include regulatory sequences, such as enhancers, constitutive and 
inducible promoters, and 5' and 3 * untranslated regions in the vector and in polynucleotide sequences 
encoding PKIN. Such elements may vary in their strength and specificity. Specific initiation signals 
may also be used to achieve more efficient translation of sequences encodmg PKIN. Such signals 

5 include the ATG initiation codon and adjacent sequences, e.g. the Kozak sequence. In cases where 
sequences encodmg PKIN and its initiation codon and upstream regulatory sequences are inserted into 
the appropriate expression vector, no additional transcriptional or translational control signals may be 
needed. However, in cases where only coding sequence, or a fragment thereof, is inserted, 
exogenous translational control signals mcludmg an in-frame ATG initiation codon should be provided 

10 by die vector. Exogenous translational elements and initiation codons may be of various origins, both 
natural and synthetic. The efficiency of expression may be enhanced by the inclusion of enhancers 
appropriate for the particular host cell system used. (See, e.g., Scharf, D. et aL (1994) Results Probl. 

Cell Differ. 20:125-162.) 

Methods which are well known to those skilled in the art may be used to construct expression 

15 vectors containing sequences encodmg PKIN and appropriate transcriptional and translational control 
elements. These methods include in vitro recombinant DNA techniques, synthetic techniques, and in 
vivo genetic recombination. (See, e.g., Sambrook, J. et al. (1989) Molecular Clonmg. A Laboratory 
Manual . Cold Spring Harbor Press. Plainview NY, ch. 4, 8, and 16-17; Ausubel, F.M. et al. (1995) 
Current Protocols m Molecular Bioloev , John Wiley & Sons, New York NY, ch. 9, 13, and 16.) 

20 A variety of expression vector/host systems may be utilized to contain and express sequences 

encoding PKIN. These include, but are not limited to, microorganisms such as bacteria transformed 
with recombinant bacteriophage, plasmid, or cosmid DNA expression vectors; yeast transformed wifli 
yeast expression vectors; insect cell systems infected with viral expression vectors (e.g., baculovirus); 
plant cell systems transfoimed with viral expression vectors (e.g., cauliflower mosaic virus, CaMV, or 

25 tobacco mosaic viros, TMV) or with bacterial expression vectors (e.g., Ti or pBR322 plasmids); or 
animal cell systems. (See, e.g., Sambrook, supra : Ausubel, supra ; Van Heeke, G. and S.M. Schuster 
(1989) J. Biol. Chem. 264:5503-5509; Engelhard, E.K. et al. (1994) Proc. Natl. Acad. Sci. USA 
91:3224-3227; Sandig, V. et al. (1996) Huhl Gene Ther. 7:1937-1945; Takamatsu, N. (1987) EMBO 
J. fi!^fl7.^1 1 : The McGraw Mil Yearbook of Science and Technologv (1992) McGraw Ifill, New 

30 York NY, pp. 191-196; Logan, J. and T. Shenk (1984) Proc. Natl. Acad. Sci. USA 81:3655-3659; and 
Hanington, J.J. et aL (1997) Nat. Genet. 15:345-355.) Expression vectors derived from retroviruses, 
adenoviruses, or herpes or vaccinia viruses, or from various bacterial plasmids, may be used for 
delivery of nucleotide sequences to the targeted organ, tissue, or cell population. (See, e.g., Di Nicola, 
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M. et al. (1998) Cancer Gen. Ther. 5(6):350-356; Yu, M. et al. (1993) Proc. Nad. Acad. Sci. USA 
90(13):6340-6344; Duller, R.M. et al (1985) Nature 317(6040):813-815; McGregor, D.R et aL (1994) 
Mol. Immunol. 31(3):219-226; and Verma, LM. and N. Somia (1997) Nature 389:239-242.) The 
invention is not limited by the host cell employed. 
5 In bacterial systems, a number of cloning and expression vectors may be selected depending 

upon the use intended for polynucleotide sequences encoding PKIN. For example, routine cloning, 
subcloning, and propagation of polynucleotide sequences encoding PKIN can be achieved using a 
multifunctional E. coli vector such as PBLUESCRIPT (Stratagene, La JoUa CA) or PSPORTl 
plasmid (life Technologies). Ligation of sequences encoding PKIN into the vector's multiple cloning 

10 site disrupts the ladL gene, allowing a colorimetric screening procedure for identification of 

transformed bacteria containing recombinant molecules. In addition, these vectors may be useful for 
in vitro transcription, dideoxy sequencing, single strand rescue with helper phage, and creation of 
nested deletions in the cloned sequence. (See, e.g.. Van Heeke, G. and S.M. Schuster (1989) J. BioL 
Chem. 264:5503-3509.) When large quantities of PKIN are needed, e.g. for the production of 

15 antibodies, vectors which direct high level expression of PKIN may be used. For example, vectors 
containing the strong, inducible SP6 or T7 bacteriophage promoter may be used. 

Yeast expression systems may be used for production of PKIN. A number of vectors 
containing constitutive or inducible promoters, such as alpha factor, alcohol oxidase, and PGH 
promoters, may be used in the yeast Saccharomvces cerevisiae or Pichia pastoris . In addition, such 

20 vectors direct either the secretion or intracellular retention of expressed proteins and enable integration 
of foreign sequences into the host genome for stable propagation. (See, e.g., Ausubel, 1995, supra : 
Bitter, G.A. et al (1987) Methods Enzymol. 153:516-544; and Scorer, C.A. et al. (1994) 
Bio/Technology 12:181-184.) 

Plant systems may also be used for expression of PKIN. Transcription of sequences 

25 encoding PKIN may be driven by viral promoters, e.g., the 35S and 19S promoters of CaM V used 
alone or in combination with the omega leader sequence from TMV (Takamatsu, N. (1987) EMBO J. 
6:307-3 1 1). Alternatively, plant promoters such as the small subunit of RUBISCO or heat shock 
promoters may be used. (See, e.g., Coruzzi, G. et al. (1984) EMBO J. 3:1671-1680; Broglie, R. et al. 
(1984) Science 224:838-843; and Winter, J. et aL (1991) Results ProbL Cell Differ. 17:85-105.) These 

30 constructs can be introduced into plant cells by direct DNA transformation or pathogen-mediated 

transfection. (See, e.g.. The McGraw HiD Yearbook of Science and Technoloev (1992) McGraw Hill, 
New York NY, pp. 191-196.) 

In mammalian cells, a number of viral-based expression systems may be utilized. In cases 
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where an adenovirus is used as an expression vector, sequences encoding PKIN may be ligated into 
an adenovirus transcription/translation complex consisting of the late promoter and tripartite leader 
sequence. Insertion in a non-essential El or E3 region of the viral genome may be used to obtain 
infective virus which expresses PKIN in host ceDs. (See, e.g., Logan, J. and T. Shenk (1984) Proc. 

S NatL Acad. Sci. USA 81:3655-3659.) In addition, transcription enhancers, such as the Rous sarcoma 
virus (RS V) enhancer, may be used to increase expression in mammalian host cells. SV40 or EBV- 
based vectors may also be used for high-level protein expression. 

Human artificial chromosomes (HACs) may also be employed to deliver larger fragments of 
DNA than can be contained in and expressed from a plasmid. HACs of about 6 kb to 10 Mb are 

10 constructed and delivered via conventional delivery mediods (liposomes, polycationic amino polymers, 
or vesicles) for therapeutic purposes. (See, e.g., Harrington, J.J. et al. (1997) Nat Genet. 15:345- 
355.) 

For long term production of recombinant proteins in mammalian systems, stable expression of 
PKIN in cell lines is preferred. For example, sequences encoding PKIN can be transformed into cell 

15 lines using egression vectors which may contain viral origins of replication and/or endogenous 

expression elements and a selectable marker gene on the same or on a separate vector. Following the 
introduction of the vector, cells may be allowed to grow for about 1 to 2 days in enriched media before 
being switched to selective media. The purpose of the selectable marker is to confer resistance to a 
selective agent, and its presence allows growth and recovery of cells which successfully express the 

20 introduced sequences. Resistant clones of stably transformed cells may be propagated using tissue 
culture techniques appropriate to the cell type. 

Any number of selection systems may be used to recover transformed cell lines. These 
include, but are not limited to, the herpes simplex virus thymidine kinase and adenine 
phosphoribosyltransferase genes, for use in tk and apr cells, respectively. (See, e.g., Wigler, M. et 

25 al. (1977) Cell 11:223-232; Lowy, I. et aL (1980) Cell 22:817-823.) Also, antimetabolite, antibiotic, or 
herbicide resistance can be used as the basis for selection. For example, dhfr confers resistance to 
methotrexate; neo confers resistance to flie aminoglycosides neomycin and G-418; and als and pat 
confer resistance to chlorsulfuron and phosphinotricin acetyltransferase, respectively. (See, e.g., 
Wigler, M. et al. (1980) Proc. Natl. Acad. Sci. USA 77:3567-3570; Colbere-Garapin, F. et al. (1981) 

30 J. Mol. BioL 150:1-14.) Additional selectable genes have been described, e.g., trpB and hisD, which 
alter cellular requirements for metabolites. (See, e.g., Hartman, S.C. and R.C. Mulligan (1988) Proc. 
NatL Acad. Sci. USA 85:8047-8051.) Visible markers, e.g., anthocyanins, green fluorescent proteins 
(GFP; Clontech), 6 glucuronidase and its substrate B-ghicuronide, or luciferase and its substrate 
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luciferin may be used. These markers can be used not only to identify transfonnants, but also to 
quantify the amount of transient or stable protein expression attributable to a specific vector system. 
(See, e,g., Rhodes, CA. (1995) Methods Mol. Biol. 55:121-131.) 

Although the presence/absence of marker gene expression suggests that the gene of interest 
S is also present, the presence and expression of the gene may need to be confirmed. For example, if 
the sequence encoding PKIN is inserted within a marker gene sequence, transformed cells containing 
sequences encoding PKIN can be identified by the absence of marker gene function. Alternatively, a 
marker gene can be placed in tandem with a sequence encoding PKIN under the control of a single 
promoter. Expression of the marker gene in response to induction or selection usually indicates 

10 expression of the tandem gene as well. 

In general, host cells that contain die nucleic acid sequence encoding PKIN and that express 
PKIN may be identified by a variety of procedures known to those of skill in the art. These 
procedures inchide, but are not limited to, DNA-DNA or DNA-RNA hybridizations, PGR 
amplification, and protein bioassay or immunoassay techniques which include membrane, solution, or 

15 chip based technologies for the detection and/or quantification of nucleic acid or protein sequences. 

Immunological methods for detecting and measuring the expression of PKIN using either 
specific polyclonal or monoclonal antibodies are known in the art Examples of such techniques 
include enzyme-linked immunosoibent assays (EUSAs), radioimmunoassays (RIAs), and 
fluorescence activated cell sorting (FACS). A two-site, monoclonal-based immunoassay utilizing 

20 monoclonal antibodies reactive to two non-interfering epitopes on PKIN is preferred, but a competitive 
binding assay may be employed. These and other assays are well known in the art. (See, e.g., 
Hampton, R. et al. (1990) Serological Methods, a Laboratorv Manual . APS Press, St Paul MN, Sect 
IV; Coligan, J.E. et aL (1997) Current Protocols in Immunology . Greene Pub. Associates and Wiley- 
Interscience, New York NY; and Pound, J.D. (1998) Immunochemical Protocols . Humana Press, 

25 Totowa NJ.) 

A wide variety of labels and conjugation techniques are known by those skilled in the art and 
may be used in various nucleic acid and amino acid assays. Means for producing labeled hybridization 
or PCR probes for detecting sequences related to polynucleotides encoding PKIN include 
oligolabeling, nick translation, end-labeling, or PCR amplification using a labeled nucleotide. 
30 Alternatively, the sequences encoding PKIN, or any fragments thereof, may be cloned into a vector 
for the production of an mRNA probe. Such vectors are known in die art, are commercially available, 
and may be used to synthesize RNA probes in vitro by addition of an appropriate RNA polymerase 
such as T7, T3, or SP6 and labeled nucleotides. These procedures may be conducted using a variety 
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of commercially available kits, such as those provided by Amersham Pharmacia Biotech, Promega 
(Madison WI), and US Biochemical. Suitable reporter molecules or labels which may be used for 
ease of detection include radionuclides, enzymes, fluorescent, chemiluminescent, or chromogenic 
agents, as well as substrates, cofactors, inhibitors, magnetic particles, and the like. 
5 Host cells transformed with nucleotide sequences encoding PKIN may be cultured under 

conditions suitable for the expression and recovery of the protem from cell culture. The protem 
produced by a transformed cell may be secreted or retained intracelhilarly depending on the sequence 
and/or the vector used. As will be understood by those of skill in the art, expression vectors containing 
polynucleotides which encode PKIN may be designed to contain signal sequences whidi direct 
10 secretion of PKIN through a prokaryotic or eukaryotic cell membrane. 

In addition, a host cell strain may be chosen for its ability to modulate expression of the 
inserted sequences or to process the expressed protein in the desired fashion. Such modifications of 
flie polypeptide inchide, but are not limited to, acetylation, caiboxylation, glycosylation, phosphorylation, 
lipidation, and acylation. Post-translational processing which cleaves a "prepro" or "pro" form of the 
15 protein may also be used to specify protein targeting, folding, and/or activity. Different host cells 
which have specific celhilar machinery and characteristic mechanisms for post-translational activities 
(e.g., CHO, HeLa, MDCK, HEK293, and WI38) are available from die American Type Culture 
Collection (ATCC, Manassas VA) and may be chosen to ensure the correct modification and 
processing of the foreign protein. 
20 In anoflier embodiment of the mvention, natural, modified, or recombinant nucleic acid 

sequences encoding PKIN may be ligated to a heterologous sequence resulting in translation of a 
fusion protein in any of tiie aforementioned host systems. For example, a chimeric PKIN protein 
containing a heterologous moiety that can be recognized by a commercially available antibody may 
facilitate die screening of peptide hliraries for inhibitors of PKIN activity. Heterologous protein and 
25 peptide moieties may also facilitate purification of fusion proteins using commerciaUy available affinity 
matrices. Such moieties inchide, but are not limited to, ghitafliione S-transferase (GST), maltose 
bindmg protein (MBP), thioredoxin (Trx), calmodulin binding peptide (CBP). 6-His, FLAG, c-myc, and 
hemagghitinin (HA). GST, MBP, Trx, CBP, and 6-His enable purification of their cognate fiision 
proteins on immobilized glutathione, maltose, phenylarsine oxide, caknodulin, and metal-chelate resins, 
30 respectively. FLAG, c-myc, and hemagglutinin (HA) enable immunoaffinity purification of fusion 
proteins using commercially available monoclonal and polyclonal antibodies tiiat specifically recognize 
these epitope tags. A fusion protein may also be engineered to contam a proteolytic cleavage site 
located between the PKIN encoding sequence and the heterologous protein sequence, so that PKIN 
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may be cleaved away from the heterologous moiety following purification. Methods for fusion protein 
expression and purification are discussed in Ausubel (199S, supra , ch. 10). A variety of commercially 
available kits may also be used to facilitate expression and purification of fusion proteins. 

In a further embodiment of the invention, synthesis of radiolabeled PKIN may be achieved in 
S vitro using the TNT rabbit reticulocyte lysate or wheat germ extract system (Promega). These 

systems couple transcription and translation of protein-coding sequences operably associated with the 
T7, T3, or SP6 promoters. Translation takes place in the presence of a radiolabeled amino acid 
precursor, for example, ^^S-methionine. 

PKIN of the present invention or fragments thereof may be used to screen for compounds 

10 that specifically bind to PKIN. At least one and up to a phirality of test compounds may be screened 
for specific binding to PKIN. Examples of test compounds include antibodies, oligonucleotides, 
proteins (e.g., receptors), or small molecules. 

In one embodiment, the compound thus identified is closely related to the natural ligand of 
PKIN, e.g., a ligand or fragment thereof, a natural substrate, a structural or functional mimetic, or a 

15 natural binding partner. (See, e.g., Coligan, J.E. et al. (1991) Current Protocols in Immunology 1(2): 
Chapter 5.) Similarly, the compound can be closely related to Ate natural receptor to which PKIN 
binds, or to at least a fragment of the receptor, e.g., the ligand binding site. In either case, the 
compound can be rationally designed using known techniques. In one embodiment, screening for 
these compounds involves producing appropriate cells which e?^ress PKIN, either as a secreted 

20 protein or on the ceU membrane. Preferred cells include ceUs from mammals, yeast, Drosophila , or E. 
coli . Cells expressing PKIN or cell membrane fractions which contain PKIN are then contacted with 
a test compound and binding, stimulation, or inhibition of activity of either PKIN or the compound is 
analyzed. 

An assay may simply test binding of a test compound to the polypeptide, wherein binding is 
25 detected by a fluorophore, radioisotope, enzyme conjugate, or other detectable label For example, the 
assay may comprise the steps of combining at least one test compound with PKIN, either in solution 
or affixed to a solid support, and detecting the binding of PKIN to the compound. Altematively, the 
assay may detect or measure binding of a test compound in the presence of a labeled competitor. 
Additionally, the assay may be carried out using cell-free preparations, chemical libraries, or natural 
30 product mixtures, and the test compound(s) may be free in solution or affixed to a solid support. 

PKIN of the present invention or fragments thereof may be used to screen for compounds 
that modulate the activity of PKIN. Such compounds may include agonists, antagonists, or partial or 
inverse agonists. In one embodiment, an assay is performed under conditions permissive for PKIN 

a 

46 



wo 02/33099 



PCTAJSOl/47728 



activity, wherein PKIN is combined with at least one test compound, and the activity of PKIN in the 
presence of a test compound is compared with the activity of PKIN in flie absence of the test 
compound. A change in the activity of PKIN in the presence of the test compound is indicative of a 
compound that modulates flie activity of PKIN. Alternatively, a test compound is combined with an in 
5 vitro or cell-free system comprismg PKIN under conditions suitable for PKIN activity, and the assay 
is performed. In either of these assays, a test compound which modulates flie activity of PKIN may 
do so indirectly and need not come in direct contact with the test compound. At least one and up to a 
plurality of test conq>ound5 may be screened. 

In anoAer embodiment, polynucleotides encoding PKIN or their mammalian homologs may be 
10 *Taiocked out" in an anunal model system using homologous recombmation in embryonic stem (ES) 
cells. Such techniques are well known in the art and are useful for the generation of ammal models of 
human disease. (See, e.g., U.S. Patent No. 5,175,383 and U.S. Patent No. 5,767,337.) For example, 
mouse ES cells, such as the mouse 129/SvJ ceU line, are derived from the early mouse embryo and 
grown in culture. The ES cells are transformed with a vector contaming the gene of interest disrupted 
15 by a marker gene, e.g., the neomycin phosphotransferase gene (neo; Capecchi, M.R. (1989) Science 
244:1288-1292). The vector integrates into the corresponding region of the host genome by 
homologous recombination. Alternatively, homologous recombination takes place using the Cre-loxP 
system to knockout a gene of interest in a tissue- or developmental stage-specific manner (Marth, J.D. 
(1996) Clin. Invest. 97:1999-2002; Wagner, K.U. et al. (1997) Nucleic Acids Res. 25:4323-4330). 
20 Transformed ES cells are identified and microinjected into mouse cell blastocysts such as those fix)m 
the C57BL/6 mouse straiiL The blastocysts are surgically transferred to pseudopregnant dams, and 
the resulting chimeric progeny are genotyped and brejd to produce heterozygous or homozygous 
strains. Transgenic ammals thus generated may be tested with potential therapeutic or toxic agents. 
Polynucleotides encoding PKIN may also be manipulated in vitro in ES cells derived fix)m 
25 human blastocysts. Human ES cells have the potential to differentiate into at least eight separate cell 
lineages mcluding endoderm, mesoderm, and ectodermal cell types. These cell lineages differentiate 
into, for example, neural cells, hematopoietic lineages, and cardiomyocytes (Thomson, J. A. et aL 
(1998) Science 282:1145-1147). 

Polynucleotides encoding PKIN can also be used to create **knockin" humanized animals 
30 (pigs) or transgenic animals (mice or rats) to model human disease. WTith knockin technology, a region 
of a polynucleotide encoding PKIN is injected into ammal ES cells, and die injected sequence 
integrates into tiie animal cell genome. Transformed cells are mjected into blastulae, and the blastulae 
are implanted as described above. Transgenic progeny or inbred lines are studied and treated with 
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potential pharmaceutical agents to obtain information on treatment of a human disease. Alternatively, 
a mammal inbred to overexpress PKIN» e.g., by secreting PKIN in its milk, may also serve as a 
convenient source of that protein (Janne, J. et al. (1998) Biotechnol. Annu. Rev. 4:55-74). 

THERAPEUTICS 

5 Chemical and structural similarity, e.g., in the context of sequences and motifs, exists between 

regions of PKIN and human kinases. In addition, the expression of PKIN is closely associated with 
brain, breast tumor, cardiovascular, digestive, fallopian tube tumor, fetal stomach, nervous, ovarian 
tumor, pancreatic tumor, peritoneal tumor, pituitary gland, placental, prostate tumor, neural, spinal cord, 
and testicular tissues, and with umbilical cord blood dendritic cells. Therefore, PKIN appears to play a 
10 role in cancer, immune disorders, disorders affecting growth and development, cardiovascular 

diseases, and lipid disorders. In the treatment of disorders associated with increased PKIN expression 
or activity, it is desirable to decrease the expression or activity of PKIN. In the treatment of disorders 
associated with decreased PKIN expression or activity, it is desirable to increase the expression or 
activity of PKIN. 

IS Therefore, in one embodiment, PKIN or a fragment or derivative thereof may be administered 

to a subject to treat or prevent a disorder associated with decreased expression or activity of PKIN. 
Examples of such disorders include, but are not limited to, a cancer, such as adenocarcinoma, 
leukemia, lymphoma, melanoma, myeloma, sarcoma, teratocarcinoma, and, in particular, cancers of 
the adrenal gland, bladder, bone, bone marrow, brain, breast, cervix, gall bladder, ganglia, 

20 gastrointestinal tract, heart, kidney, liver, hmg, muscle, ovary, pancreas, parathyroid, penis, prostate, 
salivary glands, skin, spleen, testis, thymus, thyroid, and uterus, leukemias such as multiple myeloma 
and lymphomas such as Hodgkin's disease; an immune disorder, such as acquired immunodeficiency 
syndrome (AIDS), Addison* s disease, adult respiratory distress syndrome, allergies, ankylosing 
spondylitis, amyloidosis, anemia, asthma, atherosclerosis, autoimmune hemolytic anemia, autoimmune 

25 thyroiditis, autoimmune polyendocrinopatfay-candidiasis-ectodermal dystrophy (APECED), bronchitis, 
cholecystitis, contact dermatitis, Crohn's disease, atopic dermatitis, dermatomyositis, diabetes mellitus, 
emphysema, episodic lymphopenia with lymphocytotoxins, erythroblastosis fetalis, erythema nodosum, 
atrophic gastritis, glomerulonephritis, Goodpasture's syndrome, gout. Graves' disease, Hashimoto's 
thyroiditis, hypereosinophilia, irritable bowel syndrome, multiple sclerosis, myasthenia gravis, 

30 myocardial or pericardial ioflanunation, osteoarthritis, osteoporosis, pancreatitis, polymyositis, psoriasis, 
Reiter's syndrome, rheumatoid arthritis, scleroderma, Sjogren's syndrome, systemic anaphylaxis, 
systemic lupus erythematosus, systemic sclerosis, thrombocytopenic purpura, ulcerative colitis, uveitis, 
Werner syndrome, complications of cancer, hemodialysis, and extracorporeal circulation, viral, 
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bacterial, fiingal, parasitic, protozoal, and helminthic infections, and trauma; a growth and 
developmental disorder, such as actinic keratosis, arteriosclerosis, atherosclerosis, bursitis, cirrhosis, 
hepatitis, mixed connective tissue disease (MCTD), myelofibrosis, paroxysmal nocturnal 
hemoglobinuria, polycythemia vera, psoriasis, primary thrombocythemia, and cancers inchiding 

5 adenocarcinoma, leukemia, lymphoma, melanoma, myeloma, sarcoma, teratocarcinoma, and, in 

particular, cancers of die adrenal gland, bladder, bone, bone marrow, brain, breast, cervix, gallbladder, 
ganglia, gastrointestinal tract, heart, kidney, liver, hing, muscle, ovary, pancreas, parathyroid, penis, 
prostate, salivary glands, skin, spleen, testis, thymus, thyroid, and uterus, renal tubular acidosis, anemia, 
Cushing^s syndrome, achondroplastic dwarfism, Duchenne and Becker muscular dystrophy, epilepsy, 

10 gonadal dysgenesis, WAGR syndrome (Wihns' tumor, aniridia, genitourinary abnormalities, and mental 
retardation), Smith-Magenis syndrome, myelodysplastic syndrome, hereditary mucoepithelial dysplasia, 
hereditary keratodermas, hereditary neuropathies such as Charcot-Marie-Tooth disease and 
neurofibromatosis, hypothyroidism, hydrocephalus, seizure disorders such as Syndenham*s chorea and 
cerebral palsy, spina bifida, anencephaly, craniorachischisis, congenital glaucoma, cataract, and 

15 sensorineural hearing loss; a cardiovascular disease, such as arteriovenous fistula, atherosclerosis, 
hypertension, vasculitis, Raynaud*s disease, aneurysn:is, arterial dissections, varicose veins, 
thrombophlebitis and phlebothrombosis, vascular tumors, and complications of thrombolysis, balloon 
angioplasty, vascular replacement, and coronary artery bypass graft surgery, congestive heart failure, 
ischemic heart disease, angina pectoris, myocardial infarction, hypertensive heart disease, 

20 degenerative valvular heart disease, calcific aortic valve stenosis, congenitally bicuspid aortic valve, 
mitral annular calcification, mitral valve prolapse, rheumatic fever and iheumatic heart disease, 
infective endocarditis, nonbacterial thrombotic endocarditis, endocarditis of systemic hipus 
erythematosus, carcinoid heart disease, cardiomyopathy, myocarditis, pericarditis, neoplastic heart 
disease, congenital heart disease, and complications of cardiac transplantation, congenital lung 

25 anomalies, atelectasis, pulmonary congestion and edema, pulmonary embolism, puhnonary hemonhage, 
puhnonary infarction, pulmonary hypertension, vascular sclerosis, obstractive pulmonary disease, 
restrictive pulmonary disease, chronic obstructive pulmonary disease, emphysema, chronic bronchitis, 
bronchial asthma, bronchiectasis, bacterial pneumonia, viral and mycoplasmal pneumonia, lung 
abscess, pulmonary tuberculosis, diffuse interstitial diseases, pneumoconioses, sarcoidosis, idiopathic 

30 pulmonary fibrosis, desquamative interstitial pneumonitis, hypersensitivity pneumonitis, pulmonary 
eosinophilia bronchiolitis obliterans-organizmg pneumonia, diffuse puhnonary hemonhage syndromes, 
Goodpasture's syndromes, idiopathic pulmonary hemosiderosis, pulmonary involvement in 
collagen- vascular disorders, pulmonary alveolar proteinosis, hing tumors, inflammatory and 
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noninflammatory pleural effusions, pneumothorax, pleural tumors, drug-induced lung disease, radiation- 
induced hing disease, and complications of lung transplantation; and a lipid disorder such as fatty liver, 
cholestasis, primary biliary cinhosis, carnitine deficiency, carnitine palmitoyltransferase deficiency, 
myoadenylate deaminase deficiency, hypertriglyceridemia, lipid storage disorders such Fabry's 
S disease, Gaucher* s disease, Niemann-Pick's disease, metachromatic leukodystrophy, 

adrenoleukodystrophy, GM2 gangliosidosis, and ceroid lipofuscinosis, abetalipoproteinemia, Tangier 
disease, hyperlipoproteinemia, diabetes mellitus, lipodystrophy, lipomatoses, acute panniculitis, 
disseminated fat necrosis, adiposis dolorosa, lipoid adrenal hyperplasia, minimal change disease, 
lipomas, atherosclerosis, hypercholesterolemia, hypercholesterolemia with hypertriglyceridemia, 

10 primary hypoalphalipoproteinemia, hypothyroidism, renal disease, liver disease, lecithinxholesterol 
acyltransferase deficiency, cerebrotendinous xanthomatosis, sitosterolemia, hypocholesterolemia, Tay* 
Sachs disease, Sandhoff s disease, hyperlipidemia, hyperlipemia, lipid myopathies, and obesity. 

In another embodiment, a vector capable of expressing PKIN or a fragment or derivative 
thereof may be administered to a subject to treat or prevent a disorder associated with decreased 

IS expression or activity of PKIN including, but not limited to, diose described above. 

In a further embodiment, a composition comprising a substantially purified PKIN in 
conjunction with a suitable pharmaceutical carrier may be administered to a subject to treat or prevent 
a disorder associated with decreased eiqiression or activity of PKIN including, but not limited to, those 
provided above. 

20 In still another embodiment, an agonist which modulates the activity of PKIN may be 

administered to a subject to treat or prevent a disorder associated with decreased expression or 
activity of PKIN including, but not limited to, those listed above. 

In a further embodiment, an antagonist of PKIN may be administered to a subject to treat or 
prevent a disorder associated with increased expression or activity of PKIN. Examples of such 

25 disorders include, but are not limited to, those cancer, inunune disorders, disorders affecting growth 
and development, cardiovascular diseases, and lipid disorders described above. In one aspect, an 
antibody which specifically binds PKIN may be used directly as an antagonist or indirectly as a 
targeting or delivery mechanism for bringing a pharmaceutical agent to cells or tissues which express 
PKIN. 

30 In an additional embodiment, a vector expressing the complement of the polynucleotide 

encoding PKIN may be administered to a subject to treat or prevent a disorder associated with 
increased expression or activity of PKIN including, but not limited to, those described above. 

In other embodiments, any of the proteins, antagonists, antibodies, agonists, complementary 
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sequences, or vectors of flie invention may be administered in combination witti other appropriate 
therapeutic agents. Selection of the appropriate agents for use in combination tiierapy may be made 
by one of ordinary skill in the art, according to conventional pharmaceutical principles. The 
combination of therapeutic agents may act synergistically to effect die treatment or prevention of the 

5 various disorders described above. Using this approach, one may be able to achieve therapeutic 
efficacy with lower dosages of each agent, thus reducing the potential for adverse side effects. 

An antagonist of PKIN may be produced using metiiods which are generally known in the art 
In particular, purified PKIN may be used to produce antibodies or to screen libraries of 
pharmaceutical agents to identify those which specifically bmd PKIN. Antibodies to PKIN may also 

10 be generated using methods that are well known in the art. Such antibodies may include, but are not 
limited to, polyclonal, monoclonal, chimeric, and single chain antibodies. Fab fragments, and fragments 
produced by a Fab expression library. Neutralizing antibodies (i.e., those which mhibit dimer 
formation) are generally preferred for therapeutic use. 

For the production of antibodies, various hosts including goats, rabbits, rats, mice, humans, and 

15 others may be immunized by injection with PKIN or with any fragment or oligopeptide thereof which 
has immunogenic properties. Depending on the host species, various adjuvants may be used to 
increase immunological response. Such adjuvants include, but are not limited to, Freund's, mineral gels 
such as alummum hydroxide, and surface active substances such as lysolecithin, phironic polyols, 
polyanions, peptides, oil emulsions, KLH, and dinitrophenol. Among adjuvants used in humans, BCG 

20 (bacilli Calmette-Guerin) and Corvnebacterium parvum are especially preferable. 

It is preferred that the oligopeptides, peptides, or fragments used to induce antibodies to PKIN 
have an amino acid sequence consisting of at least about 5 amino acids, and generally will consist of at 
least about 10 amino acids. It is also preferable that these oligopeptides, peptides, or fragments are 
identical to a portion of the amino acid sequence of flie natural protein. Short stretches of PKIN 

25 amino acids may be fused with tiiose of another protem, such as KLH, and antibodies to the chimeric 

molecule may be produced. 

Monoclonal antibodies to PKIN may be prepared usmg any technique whidi provides for the 
production of antibody molecules by continuous cell lines m culture. These include, but are not limited 
to, die hybridoma technique, the human B-cell hybridoma technique, and the EBV-hybridoma 
30 technique. (See, e.g., Kohler. G. et aL (1975) Nature 256:495-497; Kozbor, D. et al. (1985) J. 

Immunol Methods 81:31-42; Cote, R.J. et aL (1983) Proc. NatL Acad. Sci. USA 80:2026-2030; and 
Cole, S.P. et al. (1984) Mol. Cell BioL 62:109-120.) 

In addition, techniques developed for die production of "chhneric antibodies," such as die 
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splicmg of mouse antibody genes to human antibody genes to obtain a molecule with appropriate 
antigen specificity and biological activity, can be used. (See, e.g., Morrison, S.L. et al. (1984) Proc. 
NaU. Acad. Sci. USA 81:6851-6855; Neuberger, M.S. et al. (1984) Nature 312:604-608; and Takeda, 
S. et al. (1985) Nature 314:452-454.) Alternatively, techniques described for the production of single 

S chain antibodies may be adapted, using methods known in the art, to produce PKIN-specific single 
chain antibodies. Antibodies with related specificity, but of distinct idiotypic composition, may be 
generated by chain shuffling from random combinatorial immunoglobulin libraries. (See, e.g.. Burton, 
D.R. (1991) Proc. Nad. Acad. Sci. USA 88:10134-10137.) 

Antibodies may also be produced by inducing in vivo production in the lymphocyte population 

10 or by screening inununoglobulin libraries or panels of highly specific binding reagents as disclosed in 
the literature. (See, e.g., Qrlandi, R. et al. (1989) Proc. Nafl. Acad. Sci. USA 86:3833-3837; Winter, 
G. et al. (1991) Nature 349:293-299.) 

Antibody fragments which contain specific binding sites for PKIN may also be generated. 
For example, such fragments include, but are not limited to, F(eb\ fragments produced by pepsin 

IS digestion of the antibody molecule and Fab fragments generated by reducing the disulfide bridges of 
ttie F(ab')2 fragments. Alternatively, Fab expression libraries may be constructed to allow rapid and 
easy identification of monoclonal Fab fragments wilh the desired specificity. (See, e.g., Huse, W.D. 
et aL (1989) Science 246:1275-1281.) 

Various immunoassays may be used for screemng to identify antibodies having the desired 

20 specificity. Numerous protocols for competitive binding or immunoradiometric assays using either 
polyclonal or monoclonal antibodies with established specificities are well known in the art Such 
immunoassays typically involve the measurement of complex formation between PKIN and its 
specific antibody. A two-site, monoclonal-based immunoassay utilizing monoclonal antibodies reactive 
to two non-interfering PKIN epitopes is generally used, but a competitive binding assay may also be 

25 employed (Pound, supra). 

Various methods such as Scatchard analysis in conjunction with radioimmunoassay techniques 
may be used to assess die affinity of antibodies for PKIN. Affinity is expressed as an association 
constant, K^, which is defined as the molar concentration of PKIN-antibody complex divided by the 
molar concentrations of free antigen and free antibody under equiUbrium conditions. The K^ 

30 determined for a preparation of polyclonal antibodies, which are heterogeneous in their affinities for 
multiple PKIN epitopes, represents the average affinity, or avidity, of the antibodies for PKIN. The 
K. determined for a preparation of monoclonal antibodies, which are monospecific for a particular 
PKIN epitope, represents a true measure of affinity. High-affinity antibody preparations with K« 
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ranging from about 10^ to 10^ L/mole are preferred for use in inununoassays in which the PKIN- 
antibody complex must widistand rigorous manipulations. Low-affinity antibody preparations with 
ranging from about 10*^ to 10^ L/mole are preferred for use in inmiunopurification and similar 
procedures which ultimately require dissociation of PKIN, preferably in active form, from the antibody 
5 (Catty, D, (1988) Antibodies. Volume I: A Practical Approach . IRL Press, Washington DC; LiddeU, 
J.E. and A. Cryer (1991) A Practical Guide to Monoclonal Antibodies . John Wiley & Sons, New York 
NY). 

The titer and avidity of polyclonal antibody preparations may be further evaluated to determine 
the quality and suitability of such preparations for certain downstream applications. For example, a 
10 polyclonal antibody preparation containing at least 1-2 mg specific antibodyAnl, preferably 5-10 mg 
specific antibody/ml, is generally employed in procedures requiring precipitation of PKIN-antibody 
complexes. Procedures for evaluating antibody specificity, titer, and avidity, and guidelines for 
antibody quality and usage in various applications, are generally available. (See, e.g.. Catty, supra , and 
Coligan et al. supra .) 

15 In another embodiment of die invention, the polynucleotides encoding PKIN, or any fragment 

or complement thereof, may be used for therapeutic purposes. In one aspect, modifications of gene 
expression can be achieved by designing complementary sequences or antisense molecules (DNA, 
RNA, PNA, or modified oligonucleotides) to the coding or regulatory regions of the gene encoding 
PKIN. Such technology is well known in the art, and andsense oligonucleotides or larger fragments 

20 can be designed from various locations along the coding or control regions of sequences encoding 
PKIN. (See, e.g., Agrawal, S., ed. (1996) Antisense Therapeutics . Humana Press Inc., Totawa NJ.) 

In therapeutic use, any gene delivery system suitable for introduction of the antisense 
sequences into appropriate target cells can be used. Antisense sequences can be delivered 
intracellularly in the form of an expression plasmid which, upon transcription, produces a sequence 

25 complementary to at least a portion of the cellular sequence encoding the target protein. (See, e.g.. 
Slater, J.E. et al. (1998) J. Allergy Clin. Immunol 102(3):469-475; and Scanlon, K.J. et al. (1995) 
9(13):1288-1296.) Antisense sequences can also be introduced intracellularly through the use of viral 
vectors, such as retrovirus and adeno-associated virus vectors. (See, e.g.. Miller, A.D. (1990) Blood 
76:271; Ausubel, supra : Uckert, W. and W. Waltiier (1994) Pharmacol. Ther. 63(3):323-347.) Other 

30 gene delivery mechanisms include liposome-derived systems, artificial viral envelopes, and other 
systems known in die art. (See, e.g., Rossi, J.J. (1995) Br. Med. BuU. 51(l):217-225; Boado, R.J. et 
al. .(1998) J. Pharm. Sci. 87(11):1308-1315; and Morris, M.C et al. (1997) Nucleic Acids Res. 
25(14):2730-2736.) 
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In another embodiment of the invention, polynucleotides encoding PKIN may be used for 
somatic or germline gene therapy. Gene therapy may be performed to (i) correct a genetic deficiency 
(e.g., in the cases of severe combined immunodeficiency (SCID)-Xl disease characterized by X- 
linked inheritance (Cavazzana-Calvo, M. et al. (2000) Science 288:669-672), severe combined 
S immunodeficiency syndrome associated widi an inherited adenosine deaminase (ADA) deficiency 
(Blaese. R.M. et al. (1995) Science 270:475-480; Bordignon, C. et aL (1995) Science 270:470-475), 
cystic fibrosis (Zabner, J. et al. (1993) Cell 75:207-216; Crystal, R.G. et al. (1995) Hum. Gene 
Therapy 6:643-666; Crystal, R.G. et ai (1995) Hum. Gene Therapy 6:667-703), thalassamias, familial 
hypercholesterolemia, and hemophilia resulting from Factor Vm or Factor DC deficiencies (Crystal, 

10 R.G. (1995) Science 270:404-410; Verma, LM. and N. Somia (1997) Nature 389:239-242)), (u) 
express a conditionally lethal gene product (e.g., in the case of cancers which result from unregulated 
cell proliferation), or (iii) express a protein which affords protection against intracellular parasites (e.g., 
against human retroviruses, such as human immunodeficiency virus (HIV) (Baltimore, D. (1988) 
Nature 335:395-396; Poeschla, E. et al. (1996) Proc. NatL Acad. Sci. USA. 93:11395-11399), 

15 hepatitis B or C virus (HBV, HCV); fungal parasites, such as Candida albicans and Paracoccidioides 
brasiliensis : and protozoan parasites such as Plasmodium falcipamm and Trvpanosoma cruzi) . In the 
case where a genetic deficiency in PKIN expression or regulation causes disease, the expression of 
PKIN from an appropriate population of transduced cells may alleviate the clinical manifestations 
caused by the genetic deficiency. 

20 In a further embodiment of the invention, diseases or disorders caused by deficiencies in 

PKIN are treated by constructing manmialian expression vectors encodiog PKIN and introducing 
these vectors by mechanical means into PKIN-deficient cells. Mechanical transfer technologies for 
use with cells in vivo or ex vitro include (i) direct DNA microinjection into individual cells, (ii) ballistic 
gold particle delivery, (iii) liposome-mediated transfection, (iv) receptor-mediated gene transfer, and 

25 (v) the use of DNA transposons (Morgan, R.A. and W.F. Anderson (1993) Annu. Rev. Biochem. 
62:191-217; Ivies, Z. (1997) CeU 91:501-510; Boulay, J-L. and R Recipon (1998) Curr. Opin. 
Biotechnol. 9:445-450). 

Expression vectors that may be effective for the expression of PKIN include, but are not 
limited to, the PCDNA 3.1, EPITAG, PRCCMV2, PREP, PVAX, PCR2-T0P0TA vectors 

30 (Invitrogen, Carlsbad CA), PCMV-SCBOPT, PCMV-TAG, PEGSH/PERV (Stratagene, La Jolla CA); 
and PTET-OFF, PTET-ON, PTRE2, PTRE2-LUC, PTK-HYG (Qontech, Palo Alto CA). PKIN 
may be expressed using (i) a constitutively active promoter, (e.g., from cytomegalovirus (CMV), Rous 
sarcoma virus (RSV). SV40 virus, thymidine kinase (TK), or p-actin genes), (ii) an inducible promoter 
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(e.g., the tetracycline-regulated promoter (Gossen, M. and H. Bujard (1992) Proc. Natl. Acad. Sci. 
USA 89:5547-5551; Gossen, M. et aL (1995) Science 268:1766-1769; Rossi, F.M.V. and H.M. Blau 
(1998) Curr. Opin. Biotechnol. 9:451-456), commercially available in the T-REX plasmid (Invitrogen)); 
the ecdysone-inducible promoter (available in the plasmids'PVGRXR and PIND; Invitrogen); the 

5 FK506/rapamycin inducible promoter; or the RU486/mifepristone inducible promoter (Rossi, F.M. V. 
and Blau, UM. supra) ), or (iii) a tissue-specific promoter or the native promoter of flie endogenous 
gene encoding PKIN from a normal individual. 

Commercially available liposome transformation kits (e.g., the PERFECT LIPID 
TRANSEECnON KIT, available from Invitrogen) allow one with ordinary skill in the art to deliver 

10 polynucleotides to target cells in culture and require nnnimal effott to optimize experimental 
parameters. In the alternative, transformation is performed using the calcium phosphate method 
(Graham, RL. and AJ. Eb (1973) Virology 52:456-467), or by electroporation (Neumann, E. et al. 
(1982) EMBO J. 1:841-845). The introduction of DNA to primary cells requires modification of these 
standardized mammalian transfection protocols. 

15 In another embodiment of flie invention, diseases or disorders caused by genetic defects with 

respect to PKIN expression are treated by constructing a retrovirus vector consisting of (i) the 
polynucleotide encoding PKIN under the control of an independent promoter or the retrovirus long 
terminal repeat (LTR) promoter, (ii) appropriate RNA packaging signals, and (iii) a Rev-responsive 
element (RRE) along with additional retrovirus ci5-actmg RNA sequences and coding sequences 

20 required for efficient vector propagation. Retrovirus vectors (e.g., PFB and PFBNEO) are 

commercially available (Stratagene) and are based on published data (Riviere, 1. et al. (1995) Proc. 
Natt. Acad. Sci. USA 92:6733-6737), incorporated by reference herein. The vector is propagated m 
an appropriate vector producing cell line (VPCL) that expresses an envelope gene with a tropism for 
receptors on the target cells or a prondscuous envelope protein such as VS Vg (Armentano, D. et al. 

25 (1987) J. Virol. 61:1647-1650; Bender, M.A. et al. (1987) J. Virol. 61:1639-1646; Adam, M.A. and 
A.D. Miner (1988) J. Virol. 62:3802-3806; DuU, T. et al. (1998) J. Virol. 72:8463-8471; Zufferey, R. et 
al. (1998) J. ViroL 72:9873-9880). U.S. Patent No. 5,910,434 to Rigg C*Method for obtaining 
retrovirus packaging cell lines producing high transducing efficiency retroviral supernatant'') discloses 
a method for obtaining retrovims packaging cell lines and is hereby incorporated by reference. 

30 Propagation of retrovirus vectors, transduction of a population of cells (e.g., CD4* T-cells), and the 
return of transduced cells to a patient are procedures well known to persons skilled in the art of gene 
therapy and have been well documented (Ranga, U. et al. (1997) J. Virol. 71:7020-7029; Bauer, G. et 
al. (1997) Blood 89:2259-2267; Bonyhadi, M.L. (1997) J. ViroL 71:4707-4716; Ranga, U. et al. (1998) 
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Proc. Natl. Acad. Sci. USA 95:1201-1206; Su. L. (1997) Blood 89:2283-2290). 

In the alternative, an adeBovirus-based gene therapy delivery system is used to deliver 
polynucleotides encoding PKIN to cells which have one or more genetic abnormalities with respect to 
the expression of PKIN. The construction and packaging of adenovims-based vectors are well 
S known to those with ordinary skill in the art Replication defective adenovirus vectors have proven to 
be versatile for importing genes encoding immunoregulatory proteins into intact islets in the pancreas 
(Csete» M.E. et al. (1995) Transplantation 27:263-268). Potentially useful adenoviral vectors are 
described in U.S. Patent No. 5,707,618 to Armentano ("Adenovirus vectors for gene therapy**), hereby 
incorporated by reference. For adenoviral vectors, see also Antinozzi, P. A. et al. (1999) Annu. Rev. 
10 Nutr. 19:511-544 and Verma, I.M. and N. Somia (1997) Nature 18:389:239-242, both incorporated by 
reference herein. 

In another alternative, a herpes-based, gene therapy deUvery system is used to deUver 
polynucleotides encoding PKIN to target cells which have one or more genetic abnormalities with 
respect to the expression of PKIN. The use of herpes simplex virus (HS V)-based vectors may be 

15 especially valuable for introducing PKIN to cells of the central nervous system, for which HSV has a 
tropism. The construction and packaging of herpes-based vectors are well known to those with 
ordinary skill in the art. A replication-competent herpes simplex virus (HSV) type 1 -based vector has 
been used to deliver a reporter gene to the eyes of primates (Liu, X. et al. (1999) Exp. Eye Res. 
169:385-395). The construction of a HSV-1 virus vector has also been disclosed in detail in U.S. 

20 Patent No. 5,804,413 to DeLuca ('Herpes simplex virus strains for gene transfer"), which is hereby 
incorporated by reference. U.S. Patent No. 5,804,413 teaches the use of recombinant HSV d92 
which consists of a genome containing at least one exogenous gene to be transferred to a cell under 
the control of the appropriate promoter for purposes including human gene therapy. Also taught by 
this patent are the construction and use of recombinant HSV strains deleted for ICP4, ICP27 and 

25 ICP22. For HSV vectors, see also Coins, W.F. et aL (1999) J. ViroL 73:519-532 and Xu, H. et aL 
(1994) Dev. Biol. 163:152-161, hereby incorporated by reference. The manipulation of cloned 
herpesvirus sequences, the generation of recombinant viros following the transfecdon of multiple 
plasmids containing different segments of the large herpesvirus genomes, the growth and propagation 
of herpesvirus, and the infection of cells with herpesvirus are techniques well known to those of 

30 ordinary skill in the art. 

In another alternative, an alphavirus (positive, single-stranded RNA virus) vector is used to 
deliver polynucleotides encoding PKIN to target cells. The biology of the prototypic alphavirus, 
Semliki Forest Virus (SFV), has been studied extensively and gene transfer vectors have been based 
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on the SFV genome (Garoff, H. and K.-J. Li (1998) Curr. Opin. BiotechnoL 9:464-469). During 
alphavirus RNA replication, a subgenomic RNA is generated diat normally encodes the viral capsid 
proteins. This subgenomic RNA replicates to higher levels than the fiiU length genomic RNA, 
resulting m the overproduction of capsid proteins relative to die viral proteins with enzymatic activity 
5 (e.g., protease and polymerase). Sunilarly, inserting the coding sequence for PKIN into the alphavirus 
genome in place of die capsid-coding region results in the production of a large number of PKIN- 
coding RNAs and die synthesis of high levels of PKIN in vector transduced cells. While alphavirus 
infection is typically associated wifli cell lysis widiin a few days, the ability to establish a persistent 
mfection in hamster normal kidney cells (BHK-21) widi a variant of Sindbis virus (SIN) indicates fliat 
10 the lytic replication of alphaviruses can be altered to suit die needs of die gene therapy application 
(Dryga, S.A. et al. (1997) Virology 228:74-83). The wide host range of alphaviruses will aflow the 
introduction of PKIN into a variety of cell types. The specific transduction of a subset of cells m a 
population may require flie sorting of cells prior to transduction. The mediods of manipulating 
infectious cDNA clones of alphaviruses, performmg alphavims cDNA and RNA transfections, and 
15 performing alphavirus infections, are well known to those with ordinary skill in the art. 

Oligonucleotides derived from the transcription initiation site, e.g., between about positions -10 
and +10 from the start site, may also be employed to inhibit gene expression. Similarly, inhibition can 
be achieved using triple helix base-pairing mediodology. Triple helix pairing is useful because it causes 
inhibition of die ability of die double helix to open sufficiently for die binding of polymerases, 
20 transcription factors, or regulatory molecules. Recent dierapeutic advances using triplex DNA have 
been described in die literature. (See, e.g.. Gee, J.E. et aL (1994) in Huber, B.E. and B.I. Carr, 
Molecular and Immunologic Approaches , Futura Publishing, Mt Kisco NY, pp. 163-177.) A 
complementary sequence or antisense molecule may also be designed to block translation of mRNA 
by preventing the transcript from binding to ribosomes. 
25 Ribozymes, enzymatic RNA molecules, may also be used to catalyze the specific cleavage of 

RNA. The mechanism of ribozyme action involves sequence-specific hybridization of the ribozyme 
molecule to complementary target RNA, followed by endonucleolytic cleavage. For example, 
engineered hammerhead motif ribozyme molecules may specifically and efficiently catalyze 
endonucleolytic cleavage of sequences encoding PKIN. 
30 Specific ribozyme cleavage sites within any potential RNA target are initially identified by 

scanning the target molecule for ribozyme cleavage sites, inchiding the following sequences: GUA, 
GUU, and GUC. Once identified, short RNA sequences of between 15 and 20 ribonucleotides, 
corresponding to the region of the target gene containing die cleavage site, may be evaluated for 
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secondary structural features which may render the oligonucleotide inoperable. The suitability of 
candidate targets may also be evaluated by testing accessibility to hybridization with complementary 
oligonucleotides using ribonuclease protection assays. 

Complementary ribonucleic acid molecules and ribozymes of the invention may be prepared 

S by any method known in the art for the synthesis of nucleic acid molecules. These include techniques 
for chemically synthesizing oligonucleotides such as solid phase phosphoramidite chemical synthesis. 
Alternatively, RNA molecules may be generated by in vitro and in vivo transcription of DNA 
sequences encoding PKIN. Such DNA sequences may be incorporated into a wide variety of vectors 
with suitable RNA polymerase promoters such as T7 or SP6. Alternatively, these cDNA constructs 

10 that synthesize complementary RNA, constitutively or inducibly, can be introduced into cell lines, cells, 
or tissues. 

RNA molecules may be modified to increase intracellular stability and half-life. Possible 
modifications include, but are not limited to, the addition of flanking sequences at the 5' and/or 3' ends 
of the molecule, or the use of phosphorothioate or 2' O-methyl rather than phosphodiesterase linkages 

15 within the backbone of the molecule. This concept is inherent in the production of PNAs and can be 
extended in all of these molecules by the inclusion of nontraditional bases such as inosine, queosine, 
and wybutosine, as well as acetyl-, methyl-, thio-, and similarly modified forms of adenine, cytidine, 
guanine, thymine, and uridine which are not as easily recogoized by endogenous endonucleases. 
An additional embodiment of the invention encompasses a method for screening for a 

20 compound which is effective in altering expression of a polynucleotide encoding PKIN. Compounds 
which may be effective in altering expression of a specific polynucleotide may include, but are not 
limited to, oligonucleotides, antisense oligonucleotides, triple helix-forming oligonucleotides, 
transcription factors and other polypeptide transcriptional regulators, and non-macromolecular 
chemical entities which are capable of interacting with specific polynucleotide sequences. Effective 

25 compounds may alter polynucleotide expression by acting as either inhibitors or promoters of 
polynucleotide expression. Thus, in the treatment of disorders associated with increased PKIN 
expression or activity, a compound which specifically inhibits expression of the polynucleotide 
encoding PKIN may be therapeutically useful, and in the treatment of disorders associated with 
decreased PKIN expression or activity, a compound which specifically promotes expression of the 

30 polynucleotide encoding PKIN may be therapeutically usefiil. 

At least one, and up to a plurality, of test compounds may be screened for effectiveness in 
altering expression of a specific polynucleotide. A test compound may be obtained by any method 
commonly known in the art, including chemical modification of a compound known to be effective in 
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altering polynucleotide e3q)ression; selection from an existing, commercially-available or proprietary 
library of naturally-occurring or non-natural chemical compounds; rational design of a compound 
based on chemical and/or structural properties of the target polynucleotide; and selectionirom a 
Ubrary of chemical compounds created combinatorially or randomly. A sample comprising a 

S polynucleotide encoding PKIN is exposed to at least one test compound thus obtained. The sample 
may comprise, for example, an intact or permeabilized cell, or an in vitro cell-free or reconstituted 
biochemical system. Alterations in die expression of a polynucleotide encoding PKIN are assayed by 
any method commonly known in the art. Typically, the expression of a specific nucleotide is detected 
by hybridization wifli a probe having a nucleotide sequence complementary to the sequence of the 

10 polynucleotide encoding PKIN. The amount of hybridization may be quantified, thus forming the 
basis for a comparison of the expression of the polynucleotide both with and without exposure to one 
or more test compounds. Detection of a change in the expression of a polynucleotide exposed to a 
test compound indicates that the test compound is effective in altering the expression of the 
polynucleotide. A screen for a compound effective in altering e^ipression of a specific polynucleotide 

15 can be carried out, for example, using a SnhimRaccharomvces pombe gene expression system (Atkins, 
D. et al. (1999) U.S. Patent No. 5,932,435; Amdt, G.M. et al. (2000) Nucleic Acids Res. 28:E15) or a 
human cell line such as HeLa cell (Clarke, M.L. et aL (2000) Biochem. Biophys. Res. Commun. 
268:8-13). A particular embodiment of the present invention involves screening a combinatorial library 
of oligonucleotides (such as deoxyribonucleotides, ribonucleotides, peptide nucleic acids, and modified 

20 oligonucleotides) for antisense activity against a specific polynucleotide sequence (Bruice, T.W. et al. 
(1997) U.S. Patent No. 5.686,242; Bruice, T.W. et aL (2000) U.S. Patent No. 6,022,691). 

Many methods for introducing vectors into cells or tissues are available and equally suitable 
for use in vivo , in vitro , and ex vivo . For ex vivo therapy, vectors may be introduced into stem cells 
taken from the patient and clonally propagated for autologous transplant back into that same patient. 

25 Delivery by transfection, by liposome injections, or by polycationic amino polymers may be achieved 
using methods which are well known in the art. (See, e.g., Goldman, C.K. et al. (1997) Nat. 
BiotechnoL 15:462-466.) 

Any of the therapeutic methods described above may be applied to any subject in need of 
such therapy, including, for example, mammals such as humans, dogs, cats, cows, horses, rabbits, and 

30 monkeys. 

An additional embodiment of the invention relates to the administration of a composition which 
generally comprises an active ingredient formulated with a pharmaceutically acceptable excipient 
Excipients may include, for example, sugars, starches, celluloses, gums, and proteins. Various 
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formulations are commonly known and are thoroughly discussed in the latest edition of Remineton'is 
Pharmaceutical Sciences (Maack Publishing, Easton PA). Such compositions may consist of PKIN, 
antibodies to PKIN» and mimetics, agonists, antagonists, or inhibitors of PKIN. 

The compositions utilized in this invention may be administered by any number of routes 
S including, but not limited to, oral, intravenous, intramuscular, intra-arterial, intramedullary, intrathecal, 
intraventricular, pulmonary, transdermal, subcutaneous, intraperitoneal, intranasal, enteral, topical, 
sublingual, or rectal means. 

Compositions for pulmonary administration may be prepared in liquid or dry powder form. 
These compositions are generally aerosolized inmiediately prior to inhalation by the patient. In the 
10 case of small molecules (e.g. traditional low molecular weiglht organic drugs), aerosol delivery of fast- 
acting formulations is well-known in the art. In the case of macromolecules (e.g. larger peptides and 
proteins), recent developments in the field of pulmonary delivery via the alveolar region of the hing 
have enabled the practical delivery of drugs such as insulin to blood circulation (see, e.g., Patton, J.S. 
et al., U.S. Patent No. 5,997,848). Pulmonary delivery has the advantage of administration without 
IS needle injection, and obviates the need for potentially toxic penetration enhancers. 

Compositions suitable for use in the invention include compositions wherein the active 
ingredients are contained in an effective amount to achieve the intended purpose. The determination 
of an effective dose is weU within the capability of those skilled in the art 

Specialized forms of compositions may be prepared for direct intracellular delivery of 
20 macromolecules comprising PKIN or fragments thereof. For example, liposome preparations 

containing a cell-impermeable macromolecule may promote cell fusion and intracellular delivery of the 
macromolecule. Alternatively, PKIN or a fragment thereof may be joined to a short cationic N- 
terminal portion from the HIV Tat-1 protein. Fusion proteins thus generated have been found to 
transduce into the cells of all tissues, including the brain, in a mouse model system (Schwarze, S.R. et 
25 al. (1999) Science 285:1569-1572). 

For any compound, the therapeutically effective dose can be estimated initially either in cell 
culture assays, e.g., of neoplastic cells, or in animal models such as mice, rats, rabbits, dogs, monkeys, 
or pigs. An animal model may also be used to determine the appropriate concentration range and 
route of administration. Such information can then be used to determine useful doses and routes for 
30 administration in humans. 

A therapeutically effective dose refers to that amount of active ingredient, for example PKIN 
or fragments thereof, antibodies of PKIN, and agonists, antagonists or inhibitors of PKIN, which 
ameliorates the symptoms or condition. Therapeutic efficacy and toxicity may be determined by 
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standard pharmaceutical procedures in cell cultures or with experimental animals, such as by 
calculating the ED50 (the dose therapeutically effective in 50% of the population) or LD50 (die dose 
lethal to 50% of the population) statistics. The dose ratio of toxic to therapeutic effects is the 
therapeutic index, which can be expressed as the ID^JED^q ratio. Compositions which exhibit large 

5 therapeutic indices are preferred. The data obtained from cell culture assays and animal studies are 
used to formulate a range of dosage for human use. The dosage contained in such compositions is 
preferably within a range of circulating concentrations that includes the ED50 with little or no toxicity. 
The dosage varies within this range depending upon the dosage form employed, the sensitivity of the 
patient, and the route of administration. 

10 The exact dosage will be determined by the practitioner, in ligiht of factors related to die 

subject requiring treatment. Dosage and administration are adjusted to provide sufficient levels of the 
active moiety or to maintain the desired effect. Factors which may be taken into account include die 
severity of the disease state, the general health of the subject, the age, weight, and gender of the 
subject, time and frequency of administration, drug combination(s), reaction sensitivities, and response 

15 to therapy. Long-acting compositions may be administered every 3 to 4 days, every week, or 
biweekly depending on the half-life and clearance rate of the particular formulation. 

Normal dosage amounts may vary from about 0.1 fig to 100,000 up to a total dose of 
about 1 gram, depending upon die route of administration. Guidance as to particular dosages and 
methods of delivery is provided in the literature and generally available to practitioners in the art 

20 Those skilled in the art will employ different formulations for nucleotides than for proteins or their 
inhibitors. Similarly, delivery of polynucleotides or polypeptides will be specific to particular cells, 
conditions, locations, etc. 
DIAGNOSTICS 

In another embodiment, antibodies which specifically bind PKIN may be used for the 
25 diagnosis of disorders characterized by expression of PKIN, or in assays to monitor patients being 
treated with PKIN or agonists, antagonists, or inhibitors of PKIN. Antibodies useful for diagnostic 
purposes may be prepared in die same manner as described above for therapeutics. Diagnostic 
assays for PKIN include methods which utilize the antibody and a label to detect PIGN in human body 
fluids or in extracts of cells or tissues. The antibodies may be used with or without modification, and 
30 may be labeled by covalent or non-covalent attachment of a reporter molecule. A wide variety of 
reporter molecules, several of which are described above, are known in the art and may be used. 

A variety of protocols for measuring PKIN, including ELJSAs, RIAs, and FACS, are known 
in the art and provide a basis for diagnosing altered or abnormal levels of PKIN expression. Normal 
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or standard values for PKIN expression are established by combining body fluids or cell extracts taken 
from normal mammalian subjects, for example, human subjects, with antibodies to PKIN under 
conditions suitable for complex formation. The amount of standard complex formation may be 
quantitated by various methods, such as photometric means. Quantities of PKIN expressed in subject, 

S control, and disease samples from biopsied tissues are compared with the standard values. Deviation 
between standard and subject values establishes die parameters for diagnosing disease. 

In another embodiment of the invention, the polynucleotides encoding PKIN may be used for 
diagnostic purposes. The polynucleotides which may be used include oligonucleotide sequences, 
complementary RNA and DNA molecules, and PNAs. The polynucleotides may be used to detect 

10 and quantify gene expression in biopsied tissues in which expression of PKIN may be correlated with 
disease. The diagnostic assay may be used to determine absence, presence, and excess expression of 
PKIN, and to monitor regulation of PKIN levels during therapeutic intervention. 

In one aspect, hybridization with PGR probes which are capable of detecting polynucleotide 
sequences, including genomic sequences, encoding PKIN or closely related molecules may be used to 

15 identify nucleic acid sequences which encode PKIN. The specificity of the probe, whether it is made 
from a highly specific region, e.g., the 5 ' regulatory region, or from a less specific region, e.g., a 
conserved motif, and the stringency of the hybridization or amplification will determine whether the 
probe identifies only naturally occurring sequences encoding PKIN, allelic variants, or related 
sequences. 

20 Probes may also be used for the detection of related sequences, and may have at least 50% 

sequence identity to any of the PKIN encoding sequences. The hybridization probes of the subject 
invention may be DNA or RNA and may be derived from the sequence of SEQ ID NO:23-44 or from 
genomic sequences including promoters, enhancers, and introns of the PKIN gene. 

Means for producing specific hybridization probes for DNAs encoding PKIN include the 

25 cloning of polynucleotide sequences encoding PKIN or PKIN derivatives into vectors for the 

production of mRNA probes. Such vectors are known in the art, are commercially available, and may 
be used to synthesize RNA probes in vitro by means of the addition of the appropriate RNA 
polymerases and the appropriate labeled nucleotides. Hybridization probes maybe labeled by a 
variety of reporter groups, for example, by radionuclides such as or ^^S, or by enzymatic labels, 

30 such as alkaline phosphatase coupled to the probe via avidin/biodn coupling systems, and the Uke. 

Polynucleotide sequences encoding PKIN may be used for the diagnosis of disorders 
associated with expression of PKIN. Examples of such disorders include, but are not limited to, a 
cancer, such as adenocarcinoma, leukemia, lymphoma, melanoma, myeloma, sarcoma, 
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teratocarcinoma, aad, in particular, cancers of the adrenal gland, bladder, bone, bone marrow, brain, 
breast, cervix, gall bladder, ganglia, gastrointestinal tract, heart, kidney, liver, lung, muscle, ovary, 
pancreas, parathyroid, penis, prostate, salivary glands, skin, spleen, testis, thymus, thyroid, and uterus, 
leukemias such as multiple myeloma and lymphomas such as Hodgkin's disease; an inunune disorder, 
5 such as acquired immunodeficiency syndrome (AIDS), Addison's disease, adult respiratory distress 
syndrome, allergies, ankylosing spondylitis, amyloidosis, anemia, asfhma, atfierosclerosis, autoinunune 
hemolytic anemia, autoimmune thyroiditis, autoimmune polyendocrinopathy-candidiasis-ectodermal 
dystrophy (APECED), bronchitis, cholecystitis, contact dermatitis, Crohn's disease, atopic dermatitis, 
dermatomyositis, diabetes mellitus, emphysema, episodic lymphopenia with lymphocytotoxins, 

10 erythroblastosis fetalis, erythema nodosum, atrophic gastritis, glomerulonephritis, Goodpasture's 

syndrome, gout. Graves' disease, Hashimoto's thyroiditis, hypereosinophilia, irritable bowel syndrome, 
multiple sclerosis, myasthenia gravis, myocardial or pericardial inflammation, osteoarthritis, 
osteoporosis, pancreatitis, polymyositis, psoriasis, Reiter's syndrome, rheumatoid arthritis, scleroderma, 
Sjogren's syndrome, systemic anaphylaxis, systemic lupus erythematosus, systemic sclerosis, 

IS thrombocytopenic purpura, ulcerative colitis, uveitis, Werner syndrome, complications of cancer, 

hemodialysis, and extracorporeal circulation, viral, bacterial, fungal, parasitic, protozoal, and helminthic 
infections, and trauma; a growfli and developmental disorder, such as actinic keratosis, arteriosclerosis, 
atherosclerosis, bursitis, cirrhosis, hepatitis, mixed connective tissue disease (MCTD), myelofibrosis, 
paroxysmal nocturnal hemoglobinuria, polycythemia vera, psoriasis, primary thrombocythemia, and 

20 cancers including adenocarcinoma, leukemia, lymphoma, melanoma, myeloma, sarcoma, 

teratocarcinoma, and, in particular, cancers of the adrenal gland, bladder, bone, bone marrow, brain, 
breast, cervix, gall bladder, ganglia, gastrointestinal tract, heart, kidney, liver, lung, muscle, ovary, 
pancreas, parathyroid, penis, prostate, salivary glands, skin, spleen, testis, thymus, thyroid, and uterus, 
renal tubular acidosis, anemia, Cushing's syndrome, achondroplastic dwarfism, Duchenne and Becker 

25 muscular dystrophy, epilepsy, gonadal dysgenesis, WAGR syndrome (Wilms' tumor, aniridia, 
genitourinary abnormalities, and mental retardation), Smith-Magenis syndrome, myelodysplastic 
syndrome, hereditary mucoepithelial dysplasia, hereditary keratodermas, hereditary neuropathies such 
as Charcot-Marie-Tooth disease and neurofibromatosis, hypothyroidism, hydrocephalus, seizure 
disorders such as Syndenham's chorea and cerebral palsy, spina bifida, anencephaly, 

30 craniorachischisis, congenital glaucoma, cataract, and sensorineural hearing loss; a cardiovascular 
disease, such as arteriovenous fistula, atherosclerosis, hypertension, vasculitis, Raynaud's disease, 
aneurysms, arterial dissections, varicose veins, thrombophlebitis and phlebothrombosis, vascular 
tumors, and complications of thrombolysis, balloon angioplasty, vascular replacement, and coronary 
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artery bypass graft surgery* congestive heart failure* ischemic heart disease, angina pectoris, 
myocardial infarction, hypertensive heart disease, degenerative valvular heart disease, calcific aortic 
valve stenosis, congenitally bicuspid aortic valve, mitral annular calcification, mitral valve prolapse, 
rheumatic fever and rheumatic heart disease, infective endocarditis, nonbacterial thrombotic 
5 endocarditis, endocarditis of systemic lupus erythematosus, carcinoid heart disease, cardiomyopathy, 
myocarditis, pericarditis, neoplastic heart disease, congenital heart disease, and complications of 
cardiac transplantation, congenital lung anomalies, atelectasis, pulmonary congestion and edema, 
pulmonary embolism, pulmonary hemorrhage, pulmonary infarction, pulmonary hypertension, vascular 
sclerosis, obstructive pulmonary disease, restrictive pulmonary disease, dironic obstroctive pulmonary 

10 disease, emphysema, chronic bronchitis, bronchial asthma, bronchiectasis, bacterial pneumonia, viral 
and mycoplasmal pneumonia, lung abscess, pulmonary tuberculosis, diffuse interstitial diseases, 
pneumoconioses, sarcoidosis, idiopathic pulmonary fibrosis, desquamative interstitial pneumonitis, 
hypersensitivity pneumonitis, pulmonary eosinophilia bronchiolitis obliterans-organizing pneumonia, 
diffuse pulmonary hemorrhage syndromes, Goodpasture's syndromes, idiopathic pulmonary 

15 hemosiderosis, pulmonary involvement in collagen-vascular disorders, pulmonary alveolar proteinosis, 
hing tumors, inflammatory and noninflammatory pleural effusions, pneumothorax, pleural tumors, drug- 
induced lung disease, radiation-induced hing disease, and complications of lung transplantation; and a 
lipid disorder such as fatty Uver, cholestasis, primary biliary cinfaosis, carnitine deficiency, carnitine 
palmitoyltransferase deficiency, myoadenylate deaminase deficiency, hypertriglyceridemia, lipid 

20 storage disorders such Fabry's disease, Gaucher* s disease, Niemann-Rck's disease, metachromatic 
leukodystrophy, adrenoleukodystrophy, GMj gangliosidosis, and ceroid lipofuscinosis, 
abetalipoproteinemia, Tangier disease, hyperlipoproteinemia, diabetes mellitus, lipodystrophy, 
lipomatoses, acute panniculitis, disseminated fat necrosis, adiposis dolorosa, lipoid adrenal hyperplasia, 
minima] change disease, lipomas, atherosclerosis, hypercholesterolemia, hypercholesterolemia with 

25 hypertriglyceridemia, primary hypoalphalipoproteinemia, hypothyroidism, renal disease, liver disease, 
lecithinrcholesterol acyltransferase deficiency, cerebrotendinous xanthomatosis, sitosterolemia, 
hypocholesterolemia, Tay-Sachs disease, Sandhoffs disease, hyperlipidemia, hyperlipemia, lipid 
myopathies, and obesity. The polynucleotide sequences encoding PKIN may be used in Southern or 
nortfaem analysis, dot blot, or other membrane-based technologies; in PGR technologies; in dipstick, 

30 pin, and multiformat ELISA-like assays; and in microarrays utilizing fluids or tissues from patients to 
detect altered PKIN expression. Such qualitative or quantitative methods are well known in the art. 
In a particular aspect, the nucleotide sequences encoding PKIN may be usefiil in assays that 

m 

detect the presetice of associated disorders, particularly those mentioned above. The nucleotide 
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sequences encoding PKIN may be labeled by standard methods and added to a fluid or tissue sample 
from a patient under conditions suitable for tfie formation of hybridization complexes. After a suitable 
incubation period, the sample is washed and the signal is quantified and compared widi a standard 
value. If the amount of signal in the patient sample is significantly altered in comparison to a control 

S sample then the presence of altered levels of nucleotide sequences encoding PKIN in the sample 
indicates the presence of the associated disorder. Such assays may also be used to evaluate the 
efficacy of a particular therapeutic treatment regimen in animal studies, in clinical trials, or to monitor 
the treatment of an individual patient. 

In order to provide a basis for the diagnosis of a disorder associated with expression of PKIN, 

10 a normal or standard profile for expression is established. This may be accomplished by combining 
body fluids or cell extracts taken from normal subjects, either animal or human, with a sequence, or a 
fragment thereof, encoding PKIN, under conditions suitable for hybridization or amplification. 
Standard hybridization may be quantified by comparing the values obtained from normal subjects with 
values from an experiment in which a known amount of a substantially purified polynucleotide is used. 

15 Standard values obtained in this manner may be compared widi vabes obtained from samples from 
patients who are symptomatic for a disorder. Deviation from standard values is used to establish the 
presence of a disorder. 

Once the presence of a disorder is established and a treatment protocol is initiated, 
hybridization assays may be repeated on a regular basis to determine if the level of expression in the 

20 patient begins to approximate that which is observed in the normal subject. The results obtained from 
successive assays may be used to show the efficacy of treatment over a period ranging from several 
days to months. 

With respect to cancer, the presence of an abnormal amount of transcript (either under- or 
overexpressed) in biopsied tissue from an individual may indicate a predisposition for the development 

25 of the disease, or may provide a means for detecting the disease prior to the appearance of actual 
clinical symptoms. A more definitive diagnosis of this type may allow health professionals to employ 
preventative measures or aggressive treatment earlier diereby preventing the development or further 
progression of the cancer. 

Additional diagnostic uses for oligonucleotides designed fi'om the sequences encoding PKIN 

30 may involve the use of PGR. These oligomers may be chemically synthesized, generated 

enzymatically, or produced in vitro . Oligomers will preferably contain a fi'agment of a polynucleotide 
encoding PKIN, or a fragment of a polynucleotide complementary to the polynucleotide encoding 
PKIN, and will be employed under optimized conditions for identification of a specific gene or 
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condition. Oligomers may also be employed under less stringent conditions for detection or 
quantification of closely related DNA or RNA sequences. 

In a particular aspect, oligonucleotide primers derived from die polynucleotide sequences 
encoding PKIN may be used to detect single nucleotide polymorphisms (SNPs). SNPs are 
S substitutions, insertions and deletions that are a frequent cause of inherited or acquired genetic disease 
in humans. Methods of SNP detection include, but are not limited to, single-stranded conformation 
polymorphism (SSCP) and fluorescent SSCP (fSSCP) methods. In SSCP, oligonucleotide primers 
derived from the polynucleotide sequences encoding PKIN are used to amplify DNA using the 
polymerase chain reaction (PCR). The DNA may be derived, for example, from diseased or normal 

10 tissue, biopsy samples, bodily fluids, and the Uke. SNPs in the DNA cause differences in the 

secondary and terdary structures of PCR products in single-stranded form, and diese differences are 
detectable using gel electrophoresis in non-denaturing gels. In fSCCP, the oligonucleotide primers are 
fhiorescently labeled, which allows detection of the amplimers in hig|h-throughput equipment such as 
DNA sequencing machines. Additionally, sequence database analysis methods, termed in silico SNP 

15 (isSNP), are capable of identifying polymorphisms by comparing tfie sequence of individual 

overlapping DNA fragments which assemble into a common consensus sequence. These computer- 
based methods filter out sequence variations due to laboratory preparation of DNA and sequencing 
errors using statistical models and automated analyses of DNA sequence chromatograms. In the 
alternative, SNPs may be detected and characterized by mass spectrometry using, for example, die 

20 high throughput MASSARRAY system (Sequenom, Inc., San Diego CA). 

Methods which may also be used to quantify the expression of PKIN include radiolabeling or 
biotinylating nucleotides, coamplification of a control nucleic acid, and interpolating results from 
standard curves. (See, e.g., Melby, P.C. et al. (1993) J. Immunol. Methods 159:235-244; Duplaa, C. 
et al. (1993) Anal. Biochem. 212:229-236.) The speed of quantitation of multiple samples maybe 

25 accelerated by running the assay in a high-throughput format where the oligomer or polynucleotide of 
interest is presented in various dilutions and a spectrophotometric or colorimetric response gives rapid 
quantitation. 

In further embodiments, oligonucleotides or longer fragments derived from any of the 
polynucleotide sequences described herein may be used as elements on a microarray. The microarray 
30 can be used in transcript imaging techniques which monitor the relative expression levels of large 
numbers of genes simultaneously as described below. The microarray may also be used to identify 
genetic variants, mutations, and polymorphisms. This information may be used to determine gene 
function, to understand the genetic basis of a disorder, to diagnose a disorder, to monitor 
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progression/regression of disease as a fiiiiction of gene expression, and to develop and monitor the 
activities of therapeutic agents in the treatment of disease. In particular, this information may be used 
to develop a pharmacogenomic profile of a patient in order to select the most appropriate and effective 
treatment regimen for that patient. For example, therapeutic agents which are highly effective and 
S display die fewest side effects may be selected for a patient based on his/her pharmacogenomic 
profile. 

In another embodiment, PKIN, fragments of PKIN, or antibodies specific for PKIN may be 
used as elements on a microarray. The microarray may be used to monitor or measure protein-protein 
interactions, drug-target interactions, and gene expression profiles, as described above. 

10 A particular embodiment relates to the use of the polynucleotides of the present invention to 

generate a transcript image of a tissue or cell type. A transcript image represents flie global pattern of 
gene expression by a particular tissue or cell type. Global gene expression patterns are analyzed by 
quantifying the number of expressed genes and their relative abundance under given conditions and at 
a given time. (See Seilhamer et al., Xomparative Gene Transcript Analysis," U.S. Patent No. 

15 5,840,484, expressly incorporated by reference herein.) Thus a transcript image may be generated by 
hybridizing the polynucleotides of the present invention or their complements to the totality of 
transcripts or reverse transcripts of a particular tissue or cell type. In one embodiment, the 
hybridization takes place in high-diroughput format, wherein the polynucleotides of the present 
invention or their complements comprise a subset of a plurality of elements on a microarray. The 

20 resultant transcript image would provide a profile of gene activity. 

Transcript images may be generated using transcripts isolated from tissues, cell lines, biopsies, 
or other biological samples. The transcript image may thus reflect gene expression in vivo , as in the 
case of a tissue or biopsy sample, or in vitro , as in die case of a cell line. 

Transcript images which profile the expression of the polynucleotides of the present invention 

2S may also be used in conjunction with in vitro model systems and preclinical evaluation of 

pharmaceuticals, as well as toxicological testing of industrial and naturally-occurring environmental 
compounds. All compounds induce characteristic gene expression patterns, frequently termed 
molecular fingerprints or toxicant signatures, which are indicative of mechanisms of action and toxicity 
(Nuwaysir, E.F. et al. (1999) MoL Carcinog. 24:153-159; Steiner, S. and N.L. Anderson (2000) 

30 Toxicol. Lett 1 12-1 13:467-471, expressly incorporated by reference herein). If a test compound has a 
signature similar to that of a compound with known toxicity, it is likely to share those toxic properties. 
These fingerprints or signatures are most usefiil and refined when they contain egression information 
from a large number of genes and gene families. Ideally, a genome- wide measurement of expression 
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provides the highest quality signature. Even genes whose expression is not altered by any tested 
compounds are important as weU, as the levels of expression of these genes are used to normalize the 
rest of the expression data. The normalization procedure is useful for comparison of expression data 
after treatment widi different compounds. While the assignment of gene function to elements of a 
S toxicant signature aids in interpretation of toxicity mechanisms, knowledge of gene function is not 
necessary for the statistical matching of signatures which leads to prediction of toxicity. (See, for 
example, Press Release 00-02 from the National Institute of Environmental Health Sciences, released 
February 29, 2000, available at http://www.niehs.nih.gov/oc/news/toxchip.htm.) Therefore, it is 
important and desirable in toxicological screening using toxicant signatures to include all expressed 
10 gene sequences. 

In one embodiment, the toxicity of a test compound is assessed by treating a biological sample 
containing nucleic acids with the test compound. Nucleic acids that are expressed in the treated 
biological sample are hybridized with one or more probes specific to the polynucleotides of the present 
invention, so that transcript levels corresponding to the polynucleotides of the present invention may be 

15 quantified. The transcript levels in the treated biological sample are compared wiQi levels in an 
untreated biological sample. Differences in the transcript levels between the two samples are 
indicative of a toxic response caused by the test compound in the treated sample. 

Another particular embodiment relates to the use of the polypeptide sequences of the present 
invention to analyze the proteome of a tissue or cell type. The term proteome refers to the global 

20 pattern of protein expression in a particular tissue or cell type. Each protein component of a proteome 
can be subjected individually to further analysis. Proteome expression patterns, or profiles, are 
analyzed by quantifying the number of expressed proteins and their relative abundance under given 
conditions and at a given time. A profile of a cell's proteome may thus be generated by separating 
and analyzing the polypeptides of a particular tissue or cell ^e. In one embodiment, the separation is 

25 achieved using two-dimensional gel electrophoresis, in which proteins from a sample are separated by 
isoelectric focusing in the first dimension, and then according to molecular weight by sodium dodecyl 
sulfate slab gel electrophoresis in the second dimension (Steiner and Anderson, supra) . Hie proteins 
are visualized in the gel as discrete and uniquely positioned spots, typically by staining the gel with an 
agent such as Coomassie Blue or silver or fluorescent stains. The optical density of each protein spot 

30 is generally proportional to the level of the protein in the sample. The optical densities of equivalently 
positioned protein spots from different samples, for example, from biological san^les either treated or 
untreated with a test compound or therapeutic agent, are compared to identify any changes in protein 
spot density related to the treatment. The proteins in the spots are partially sequenced using, for 
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example, standard methods employing chemical or enzymatic cleavage followed by mass 
spectrometry. The identity of the protein in a spot may be detemiined by comparing its partial 
sequence, preferably of at least 5 contiguous amino acid residues, to die polypeptide sequences of the 
present invention. In some cases, further sequence data may be obtained for definitive protein 
S identification. 

A proteomic profile may also be generated using antibodies specific for PKIN to quantify the 
levels of PKIN expression. In one embodiment, the antibodies are used as elements on a microarray, 
and protein expression levels are quantified by exposing the microarray to the sample and detecting 
the levels of protein bound to each array element (Lueking, A. et aL (1999) AnaL BiocheuL 270:103- 
10 111; Mendoze, L.G. et al. (1999) Biotechniques 27:778-788). Detection may be performed by a 

variety of methods known in the art, for example, by reacting the proteins in the sample with a thiol- or 
amino-reactive fluorescent compound and detecting the amount of fluorescence bound at each array 
element. 

Toxicant signatures at die proteome level are also useful for toxicological screening, and 
IS should be analyzed in parallel with toxicant signatures at the transcript level There is a poor 

correlation between transcript and protein abundances for some proteins in some tissues (Anderson,. 
N.L. and J. Seilhamer (1997) Electrophoresis 18:533-537), so proteome toxicant signatures maybe 
useful in the analysis of compounds which do not significantly affect the transcript image, but which 
alter the proteomic profile. In addition, the analysis of transcripts in body fluids is difficult, due to rapid 
20 degradation of mRNA, so proteomic profiling may be more reliable and informative in such cases. 

In another embodiment, the toxicity of a test compound is assessed by treating a biological 
sample containing proteins with the test compound. Proteins that are expressed in the treated 
biological sample are separated so that the amount of each protein can be quantified. The amount of 
each protein is compared to the amount of the corresponding protein in an untreated biological sample. 
25 A difference in the amount of protein between the two samples is indicative of a toxic response to the 
test compound in the treated sample. Individual proteins are identified by sequencing the amino acid 
residues of the individual proteins and comparing these partial sequences to the polypeptides of the 
present invention. 

In another embodiment, the toxicity of a test compound is assessed by treating a biological 
30 sample containing proteins with the test compound. Proteins from the biological sample are incubated 
with antibodies specific to the polypeptides of the present invention. The amount of protein recognized 
by the antibodies is quantified. The amount of protein in the treated biological sample is compared 
with the amount in an untreated biological sample. A difference in die amount of protein between the 
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two samples is indicative of a toxic response to the test compound in the treated sample. 

Microarrays may be prepared, used, and analyzed using methods known in the art (See, e.g., 
Brennan, T.M. et al, (1995) U.S. Patent No. 5,474,796; Schena, M. et al. (1996) Proc. Natl. Acad. 
Sci. USA 93:10614-10619; Baldeschweiler et al. (1995) PCT appUcation W095/251116; Shalon, D. et 

5 al. (1995) PCT appUcation WO95/35505; Heller, R.A. et al. (1997) Proc. Natl. Acad. Sci. USA 
94:2150-2155; and Heller, M.J. et al. (1997) U.S. Patent No. 5,605,662.) Various types of 
microarrays are well known and thoroughly described in DNA Microarravs: A Practical Approach , 
M. Schena, ed. (1999) Oxford University Press, London, hereby expressly incorporated by reference. 
In another embodiment of the invention, nucleic acid sequences encoding PKIN may be used 

10 to generate hybridization probes useful in mapping the naturally occurring genomic sequence. Either 
coding or noncoding sequences may be used, and in some instances, noncoding sequences may be 
preferable over coding sequences. For example, conservation of a coding sequence among members 
of a multi-gene family may potentially cause undesired cross hybridization during chromosomal 
mapping. The sequences may be mapped to a particular chromosome, to a specific region of a 

15 chromosome, or to artificial chromosome constructions, e.g., human artificial chromosomes (HACs), 
yeast artificial chromosomes (Y ACs), bacterial artificial chromosomes (B ACs), bacterial PI 
constructions, or single chromosome cDNA libraries. (See, e.g., Harrington, J.J. et al. (1997) Nat. 
Genet 15:345-355; Price, CM. (1993) Blood Rev. 7:127-134; and Trask, B.J. (1991) Trends Genet 
7:149-154.) Once mapped, the nucleic acid sequences of the invention may be used to develop 

20 genetic linkage maps, for example, which correlate the inheritance of a disease state with the 

inheritance of a particular chromosome region or restriction fi^agment length polymorphism (RFLP). 
(See, for example, Lander, E.S. and D. Botstein (1986) Proc. Natl. Acad. Sci. USA 83:7353-7357.) 

Fluorescent in situ hybridization (HSH) may be correlated with other physical and genetic 
map data. (See, e.g., Heinz-Ulrich, et al. (1995) in Meyers, supra , pp. 965-968.) Examples of genetic 

25 map data can be found in various scientific journals or at the Online Mendelian Inheritance in Man 
(OMIM) World Wide Web site. Correlation between the location of the gene encoding PKIN on a 
physical map and a specific disorder, or a predisposition to a specific disorder, may help define the 
region of DNA associated with that disorder and thus may fiirther positional cloning efforts. 

In situ hybridization of chromosomal preparations and physical mapping techniques, such as 

30 linkage analysis using established chromosomal markers, may be used for extending genetic maps. 
Often the placement of a gene on the chromosome of another mammaUan species, such as mouse, 
may reveal associated markers even if the exact chromosomal locus is not known. This information is 
valuable to investigators searching for disease genes using positional cloning or other gene discovery 
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techniques. Once the gene or genes responsible for a disease or syndrome have been crudely 
localized by genetic linkage to a particular genomic region, e.g.» ataxia-telangiectasia to 1 lq22-23, any 
sequences mapping to that area may represent associated or regulatory genes for further investigation. 
(See, e.g., Gatti, R.A. et aL (1988) Nature 336:577-580.) The nucleotide sequence of the instant 

5 invention may also be used to detect differences in the chromosomal location due to translocation, 
inversion, etc., among normal, carrier, or affected individuals. 

In another embodiment of the invention, PKIN, its catalytic or immunogenic fragments, or 
oligopeptides thereof can be used for screening libraries of compounds in any of a variety of drug 
screening techniques. The fragment employed in such screening may be free in solution, affixed to a 

10 solid support, borne on a cell surface, or located intracellularly. Hie formation of binding complexes 
between PKIN and the agent being tested may be measured. 

Another technique for drug screening provides for high throughput screening of compounds 
having suitable binding affinity to the protein of interest. (See, e.g., Geysen, et aL (1984) PCT 
application WO84/03564.) In this method, large numbers of different small test compounds are 

15 synfliesized on a solid substrate. The test compounds are reacted with PKIN, or fragments thereof, 
and washed. Bound PKIN is then detected by methods well known in the art. Purified PKIN can 
also be coated directly onto plates for use in the aforementioned drug screening techniques. 
Alternatively, non-neutralizing antibodies can be used to capture the peptide and immobilize it on a 
solid support. 

20 In another embodiment, one may use competitive drug screening assays in which neutralizing 

antibodies capable of binding PKIN specificaUy compete with a test compound for binding PKIN. In 

this manner, antibodies can be used to detect the presence of any peptide which shares one or more 

antigenic determinants with PKIN. 

In additional embodiments, the nucleotide sequences which encode PKIN may be used in any 
25 molecular biology techniques that have yet to be developed, provided the new techniques rely on 

properties of nucleotide sequences that are currently known, including, but not limited to, such 

properties as the triplet genetic code and specific base pair interactions. 

Without further elaboration, it is believed that one skilled in the art can, using the preceding 

description, utihze the present invention to its fullest extent. The following embodiments are, therefore, 
30 to be construed as merely illustrative, and not limitative of the remainder of the disclosure in any way 

whatsoever. 

Widiout further elaboration, it is believed that one skilled in the art can, using the preceding 
description, utilize the present invention to its fullest extent. The following preferred specific 
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embodiments are, therefore, to be construed as merely illustrative, and not limitative of the remainder 
of the disclosure in any way whatsoever. 

The disclosures of all patents, applications, and publications mentioned above and below, in 
particular U.S. Ser. No. 60/242,410, U.S. Ser. No. 60/244,068. U.S. Ser. No. 60/245,708. U.S. Ser. 
5 No. 60/247,672, U.S. Ser. No. 60/249,565, U.S. Ser. No. 60/252,730, and U.S. Ser. No. 60/250,807, 
are hereby expressly incorporated by reference. 

EXAMPLES 

I. Construction of cDNA Libraries 

10 Incyte cDNAs were derived from cDNA libraries described m the LIFESEQ GOLD 

database (Incyte Genomics, Palo Alto CA) and shown in Table 4, column 5. Some tissues were 
homogenized and lysed in guanidinium isotfaiocyanate, while others were homogenized and lysed in 
phenol or in a suitable mixture of denaturants, such as TRIZOL (Life Technologies), a monophasic 
solution of phenol and guanidine isothiocyanate. The resulting lysates were centrifuged over CsCl 

15 cushions or extracted with chloroform. RNA was precipitated from the lysates with either isopropanol 
or sodium acetate and ethanol, or by other routine methods. 

Phenol extraction and precipitation of RNA were repeated as necessary to increase RNA 
purity. In some cases, RNA was treated with DNase. For most libraries, poly(A)+ RNA was 
isolated using oligo d(T)-coupled paramagnetic particles (Promega), OUGOTEX latex particles 

20 (QIAGEN, Chatsworth CA), or an OUGOTEX mRNA purification kit (QIAGEN). Alternatively, 
RNA was isolated directiy from tissue lysates using other RNA isolation kits, e.g., the 
POLY( A)PURE mRNA purification kit (Ambion, Austin TX). 

In some cases, Stratagene was provided with RNA and constructed the corresponding cDNA 
libraries. Otherwise, cDNA was synthesized and cDNA libraries were constmcted with the 

« 

25 UNEAP vector system (Stratagene) or SUPERSCRIPT plasmid system (Life Technologies), using 
the recommended procedures or similar methods known in the art (See, e.g., Ausubel, 1997, supra , 
units 5.1-6.6.) Reverse transcription was initiated using oligo d(T) or random primers. Synthetic 
oligonucleotide adapters were ligated to double stranded cDNA, and the cDNA was digested with the 
appropriate restriction enzyme or enzymes. For most libraries, the cDNA was size-selected (300- 

30 1000 bp) using SEPHACRYL S 1000, SEPHAROSE CL2B, or SEPHAROSE CUB column 

chromatography (Amersham Pharmacia Biotech) or preparative agarose gel electrophoresis. cDNAs 
were ligated into compatible restriction enzyme sites of the polylinker of a suitable plasmid, e.g., 
PBLUESCRIPT plasmid (Stratagene), PSPORTl plasmid (Life Technologies), PCDNA2.1 plasmid 
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(Invitrogen, Carlsbad CA), PBK-CMV plasmid (Stratagene). PCR2-T0P0TA plasmid (Invitrogen), 
PCMV-ICIS plasmid (Stratagene), pIGEN (Incyte Genomics, Palo Ako CA), or pINCY (Incyte 
Genomics), or derivatives thereof. Recombinant plasmids were transformed into competent E. coli 
cells including XLl-Bhie, XLl-BhieMRF, or SOLR from Stratagene or DH5a, DHIOB, or 
5 ElectroMAX DHIOB from life Technologies. 

n. Isolation of cDNA Clones 

Plasmids obtained as described in Example I were recovered from host cells by in vivo 
excision using the UNEAP vector system (Stratagene) or by cell lysis. Plasmids were purified using 
at least one of the following: a Magic or WIZARD Minipreps DNA purification system (Promega); an 

10 AGTC Miniprep purification kit (Edge Biosystems, Gaithersburg MD); and QIAWELL 8 Plasmid, 
QIAWELL 8 Phis Plasmid, QIAWELL 8 Ultra Plasmid purification systems or the R.E.A.L. PREP 
96 plasmid purification kit from QIAGEN. Following precipitation, plasmids were resuspended in 0.1 
nol of distilled water and stored, witfi or without lyophilization, at 4''C. 

Alternatively, plasmid DNA was amplified fi^m host cell lysates using direct link PCR in a 

15 high-throughput format (Rao, V.B. (1994) Anal. Biochem. 216:1-14). Host cell lysis and thermal 
cycling steps were carried out in a single reaction nocture. Samples were processed and stored in 
384-well plates, and the concentration of amplified plasmid DNA was quantified fhiorometrically using 
PICOGREEN dye (Molecular Probes, Eugene OR) and a FLUOROSKAN n fluorescence scanner 
(Labsystems Oy, Helsinki, Finland). 

20 IIL Sequencing and Analysis 

Incyte cDNA recovered in plasmids as described in Example n were sequenced as follows. 
Sequencing reactions were processed using standard methods or high-throughput instramentation such 
as the ABI CATALYST 800 (Applied Biosystems) thermal cycler or the PTC-200 thermal cycler 
(MJ Research) in conjunction with the HYDRA microdispenser (Robbins Scientific) or the 

2S MICROLAB 2200 (Hamilton) liquid transfer system. cDNA sequencing reactions were prepared 
using reagents provided by Amersham Phannacia Biotech or supplied in ABI sequencing kits such as 
the ABI PRISM BIGDYE Terminator cycle sequencing ready reaction kit (Applied Biosystems). 
Electrophoretic separation of cDNA sequencing reactions and detection of labeled polynucleotides 
were carried out using the MEGABACE 1000 DNA sequencing system (Molecular Dynamics); the 

30 ABI PRISM 373 or 377 sequencing system (Applied Biosystems) in conjunction with standard ABI 
protocols and base calling software; or other sequence analysis systems known in the art Reading 
frames within the cDNA sequences were identified using standard methods (reviewed in Ausubel, 
1997, supra , unit 7.7). Some of the cDNA sequences were selected for extension using the 
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techniques disclosed in Example Vm. 

The polynucleotide sequences derived from Incyte cDNAs were validated by removing 
vector, linker, and poly(A) sequences and by masking ambiguous bases, using algorithms and 
programs based on BLAST, dynamic programming, and dinucleotide nearest neighbor analysis. The 
S Incyte cDNA sequences or translations thereof were then queried against a selection of public 

databases such as the GenBank primate, rodent, mammalian, vertebrate, and eukaryote databases, and 
BLOCKS, PRINTS, DOMO, PRODOM, and hidden Markov model (HMM)-based protein family 
databases such as PFAM. (HMM is a probabilistic approach which analyzes consensus primary 
structures of gene families. See, for example, Eddy, S.R. (1996) Curr. Opin. Struct BioL 6:361-365.) 

10 The queries were performed using programs based on BLAST, FAST A, BLIMPS, and HMMER. 
The Incyte cDNA sequences were assembled to produce lull length polynucleotide sequences. 
Alternatively, GenBank cDNAs, GenBank ESTs, stitched sequences, stretched sequences, or 
Genscan-predicted coding sequences (see Examples IV and V) were used to extend Incyte cDNA 
assemblages to iuU length. Assembly was performed using programs based on Fhred, Phrap, and 

IS Consed, and cDNA assemblages were screened for open reading frames using programs based on 
GeneMark, BLAST, and PASTA. The full length polynucleotide sequences were translated to derive 
the corresponding fiiU length polypeptide sequences. Alternatively, a polypeptide of the invention may 
begin at any of the methionine residues of the full length translated polypeptide. Full length polypeptide 
sequences were subsequently analyzed by querying against databases such as the GenBank protein 

20 databases (genpept), SwissProt, BLOCKS, PRINTS, DOMO, PRODOM, Prosite, and hidden 

Markov model (HMM)-based protein family databases such as PFAM. Full length polynucleotide 
sequences are also analyzed using MACDNASIS PRO software (Hitachi Software Engineering, 
South San Francisco CA) and LASERGENE software (DNASTAR). Polynucleotide and polypeptide 
sequence alignments are generated using default parameters specified by the CLUSTAL algorithm as 

25 incorporated into the MEGALIGN multisequence alignment program (DNASTAR), which also 
calculates the percent identity between aligned sequences. 

Table 7 summarizes the tools, programs, and algorithms used for the analysis and assembly of 
Incyte cDNA and ftill length sequences and provides applicable descriptions, references, and threshold 
parameters. The first column of Table 7 shows the tools, programs, and algorithms used, the second 

30 column provides brief descriptions diereof, the third cohiimi presents appropriate references, all of 
which are incorporated by reference herein in their entirety, and the fourth column presents, where 
applicable, the scores, probability values, and other parameters used to evaluate the strength of a 
match between two sequences (the higjher the score or the lower the probability value, the greater the 
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identity between two sequences). 

The programs described above for the assembly and analysis of full length polynucleotide and 
polypeptide sequences were also used to identify polynucleotide sequence fragments from SEQ ID 
NO:23-44. Fragments from about 20 to about 4000 nucleotides which are useful in hybridization and 
S amplification technologies are described in Table 4, column 4. 

IV. Identification and Editing of Coding Sequences from Genomic DNA 

Putative human kinases were initially identified by running the Genscan gene identification 
program against public genomic sequence databases (e.g., gbpri and gbhtg). Genscan is a general- 
purpose gene identification program which analyzes genomic DNA sequences from a variety of 

10 organisms (See Burge, C. and S. Karlin (1997) J. Mol. BioL 268:78-94, and Burge, C. and S. Karlin 
(1998) Curr. Opin. Stroct. Biol. 8:346-354). The program concatenates predicted exons to form an 
assembled cDNA sequence extending from a methionine to a stop codon. The output of Genscan is a 
FASTA database of polynucleotide and polypeptide sequences. The maximum range of sequence for 
Genscan to analyze at once was set to 30 kb. To determine which of these Genscan predicted cDNA 

IS sequences encode human kinases, the encoded polypeptides were analyzed by querying against 
PFAM models for human kinases. Potential human kinases were also identified by homology to 
Incyte cDNA sequences that had been annotated as human kinases. These selected Genscan- 
predicted sequences were then compared by BLAST analysis to the genpept and gbpri public 
databases. Where necessary, the (jenscan-predicted sequences were then edited by comparison to 

20 the top BLAST hit from genpept to correct errors in the sequence predicted by Genscan, such as 
extra or omitted exons. BLAST analysis was also used to find any Incyte cDNA or public cDNA 
coverage of the Genscan-predicted sequences, thus providing evidence for transcription. When Incyte 
cDNA coverage was available, diis information was used to correct or confirm the Genscan predicted 
sequence. FuU length polynucleotide sequences were obtained by assembling Genscan-predicted 

25 coding sequences with Incyte cDNA sequences and/or public cDNA sequences using the assembly 
process described in Example m. Alternatively, fiill length polynucleotide sequences were derived 
entirely from edited or unedited Genscan-predicted coding sequences. 

V. Assembly of Genomic Sequence Data with cDNA Sequence Data 
*^Stitched" Sequences 

30 Partial cDNA sequences were extended with exons predict^ by the Genscan gene 

identification program described in Example IV. Partial cDNAs assembled as described in Example 
ni were mapped to genomic DNA and parsed into clusters containing related cDNAs and Genscan 
exon predictions from one or more genomic sequences. Each cluster was analyzed using an algorithm 
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based on graph theory and dynamic programming to integrate cDNA and genomic information, 
generating possible splice variants that were subsequently confirmed, edited, or extended to create a 
full length sequence. Sequence intervals in which the entire length of the interval was present on 
more than one sequence in the cluster were identified, and intervals thus identified were considered to 

S be equivalent by transitivity. For example, if an interval was present on a cDNA and two genomic 
sequences, then all three intervals were considered to be equivalent. This process allows unrelated 
but consecutive genomic sequences to be brought together, bridged by cDNA sequence. Intervals 
thus identified were then ''stitched'' together by the stitching algorithm In the order that they appear 
along their parent sequences to generate the longest possible sequence, as well as sequence variants. 

10 Linkages between intervals which proceed along one type of parent sequence (cDNA to cDNA or 
genomic sequence to genomic sequence) were given preference over linkages which change parent 
type (cDNA to genomic sequence). The resultant stitched sequences were translated and compared 
by BLAST analysis to the genpept and gbpri public databases. Incorrect exons predicted by Genscan 
were corrected by comparison to the top BLAST hit fi-om genpept Sequences were fiirther extended 

15 with additional cDNA sequences, or by inspection of genomic DNA, when necessary. 

"Stretched^^ Sequences 

Partial DNA sequences were extended to hill length with an algorithm based on BLAST 
analysis. First, partial cDNAs assembled as described in Example m were queried agamst public 
databases such as the GenBank primate, rodent, manunalian, vertebrate, and eukaryote databases 

20 using the BLAST program. The nearest GenBank protein homolog was then compared by BLAST 
analysis to either Incyte cDNA sequences or GenScan exon predicted sequences described in 
Example IV. A chimeric protein was generated by using the resultant higih-scoring segment pairs 
(HSPs) to map the translated sequences onto the GenBank protein homolog. Insertions or deletions 
may occur in the chimeric protein with respect to the original GenBank protein homolog. The 

25 GenBank protein homolog, the chimeric protein, or both were used as probes to search for homologous 
genomic sequences fi:om the public human genome databases. Partial DNA sequences were 
therefore ''stretched*' or extended by the addition of homologous genomic sequences. The resultant 
stretched sequences were examined to determine whether it contained a complete gene. 
VI. Chromosomal Mapping of PKIN Encoding Polynucleotides 

30 The sequences which were used to assemble SEQ ID NO:23-44 were compared with 

sequences from the Incyte LDFESEQ database and public domain databases using BLAST and other 
implementations of the Smith-Waterman algorithm. Sequences from these databases that matched 
SEQ ID NO:23-44 were assembled into clusters of contiguous and overlapping sequences using 
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assembly algorithms such as Phrap (Table 7). Radiation hybrid and genetic mapping data available 
from public resources such as the Stanford Human Genome Center (SHGC), Whitehead Institute for 
Genome Research (WIGR), and G6nethon were used to determine if any of the clustered sequences 
had been previously mapped. Inclusion of a mapped sequence in a cluster resulted in the assignment 

5 of all sequences of that cluster, including its particular SEQ ID NO:, to that map location. 

Map locations are represented by ranges, or intervals, of human chromosomes. The map 
position of an interval, in centiMorgans, is measured relative to the terminus of the chromosome's p- 
arm. (The centiMorgan (cM) is a unit of measurement based on recombination frequencies between 
chromosomal markers. On average, 1 cM is roughly equivalent to 1 megabase (Mb) of DNA in 

10 humans, although this can vary widely due to hot and cold spots of recombination.) The cM distances 
are based on genetic markers mapped by Genethon which provide boundaries for radiation hybrid 
markers whose sequences were included in each of the clusters. Human genome maps and other 
resources available to the public, such as the NCBI "GeneMap'99" World Wide Web site 
(http://www.ncbi.nlm.nih.gov/genemap/), can be employed to determine if previously identified disease 

15 genes map within or in proximity to the intervals indicated above. 

In this manner, SEQ ID NO:29 was mapped to chromosome 1 within tiie interval from 199.20 
to 203.00 centiMorgans, to chromosome 13 within the interval from 105.20 centiMorgans to the q 
terminus, and to chromosome 6 within the interval from 59.60 to 72.20 centiMorgans. More than one 
map location is reported for SEQ ID NO:29, indicating tiiat sequences having diflerent map locations 

20 were assembled into a single cluster. This situation occurs, for example, when sequences having 
strong similarity, but not complete identity, are assembled into a single cluster. 
VII. Analysis of Polynucleotide Expression 

Northern analysis is a laboratory technique used to detect the presence of a transcript of a 
gene and involves the hybridization of a labeled nucleotide sequence to a membrane on which RNAs 

25 from a particular cell type or tissue have been bound. (See, e.g., Sambrook, supra, ch. 7; Ausubel 

(1995) supra , ch. 4 and 16.) 

Analogous computer techniques applying BLAST were used to search for identical or related 
molecules in cDNA databases such as GenBank or UFESEQ (Incyte Genomics). This analysis is 
much faster than multiple membrane-based hybridizations. In addition, the sensitivity of the computer 
30 search can be modified to determine whether any particular match is categorized as exact or similar. 
The basis of tiie search is the product score, which is defined as: 
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BLAST Score x Percent Identity 
5 X minimum {length(Seq. 1), length(Seq. 2)} 

The product score takes into account both the degree of similarity between two sequences and the 
S length of the sequence match. The product score is a normalized value between 0 and 100, and is 
calculated as follows: the BLAST score is multiplied by the percent nucleotide identity and the 
product is divided by (5 times the length of the shorter of the two sequences). The BLAST score is 
calculated by assigning a score of +5 for every base that matches in a high-scoring segment pair 
(HSP), and -4 for every mismatch. Two sequences may share more than one HSP (separated by 

10 gaps). If there is more than one HSPy then the pair with die highest BLAST score is used to calculate 
the product score. The product score represents a balance between fractional overlap and quality in a 
BLAST alignment. For example, a product score of 100 is produced only for 100% identity over the 
entire length of the shorter of the two sequences being compared. A product score of 70 is produced 
either by 100% identity and 70% overlap at one end, or by 88% identity and 100% overlap at the 

15 other. A product score of 50 is produced either by 100% identity and 50% overlap at one end, or 79% 
identity and 100% overlap. 

Alternatively, polynucleotide sequences encoding PKIN are analyzed with respect to the 
tissue sources from which they were derived. For example, some full length sequences are 
assembled, at least in part, with overlapping Incyte cDNA sequences (see Example m). Each cDNA 

20 sequence is derived from a cDNA library constructed from a human tissue. Each human tissue is 
classified into one of the following organ/tissue categories: cardiovascular system; connective tissue; 
digestive system; embryonic structures; endocrine system; exocrine glands; genitalia, female; genitalia, 
male; germ cells; hemic and immune system; liver; musculoskeletal system; nervous system; 
pancreas; respiratory system; sense organs; skin; stomatognathic system; unclassified/mixed; or 

25 urinary tract. The number of libraries in each category is counted and divided by the total number of 
libraries across all categories. Similarly, each human tissue is classified into one of the following 
disease/condition categories: cancer, cell line, developmental, inflammation, neurological, trauma, 
cardiovascular, pooled, and other, and the number of libraries in each category is counted and divided 
by the total number of libraries across all categories. The resulting percentages reflect the tissue- and 

30 disease-specific expression of cDNA encoding PKIN. cDNA sequences and cDNA library/tissue 
information are found in the UFESEQ GOLD database (Ihcyte Genomics, Palo Alto CA). 
VIIL Extension of PKIN Encoding Polynucleotides 

Full length polynucleotide sequences were also produced by extension of an appropriate 
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fragment of the fiiU length molecule using oligonucleotide primers designed from this fragment One 
primer was synthesized to initiate S' extension of the known fragment, and the other primer was 
synthesized to initiate 3' extension of the known fragment. The initial primers were designed using 
OLIGO 4.06 software (National Biosciences), or another appropriate program, to be about 22 to 30 
5 nucleotides in lengdi, to have a GC content of about 50% or more, and to anneal to the target 
sequence at temperatures of about 68 to about 72 ''C. Any stretch of nucleotides which would 
result in hairpin stmctures and primer-primer dimerizations was avoided. 

Selected human cDNA libraries were used to extend the sequence. If more than one 
extension was necessary or desired, additional or nested sets of primers were designed. 

10 High fidelity amplification was obtained by FCR using methods well known in the art PGR 

was performed in 96-weII plates using the PTC-200 thermal cycler (MJ Research, Inc.). The reaction 
* mix contained DNA template, 200 imiol of each primer, reaction buffer containing Mg^'^, (NEi4)2S04, 
and 2-mercaptoethanol, Taq DNA polymerase (Amersham Pharmacia Biotech), ELONGASE 
enzyme (life Technologies), and Pfii DNA polymerase (Stratagene), with the following parameters 

15 for primer pair PCI A and PCI B: Step 1: 94 ^C, 3 min; Step 2: 94 ^'C, 15 sec; Step 3: 60X, 1 min; 
Step 4: 68 X, 2 min; Step 5: Steps 2, 3, and 4 repeated 20 times; Step 6: 68**C, 5 min; Step 7: storage 
at 4''C. In the alternative, the parameters for primer pair T7 and SK-h were as follows: Step 1: 94 X, 
3 min; Step 2: 94*'C, 15 sec; Step 3: 57''C, 1 min; Step 4: 68X, 2 min; Step 5: Steps 2, 3, and 4 
repeated 20 times; Step 6: 68 ""C, 5 min; Step 7: storage at 4X. 

20 The concentration of DNA in each well was determined by dispensing 100 /il PICOGREEN 

quantitation reagent (0.25% (y/w) PICOGREEN; Molecular Ptobes, Eugene OR) dissolved in IX TE 
and 0.5 ^1 of undiluted PCR product into each well of an opaque fluorimeter plate (Coming Costar, 
Acton MA), allowing the DNA to bind to the reagent. The plate was scanned in a Fhioroskan II 
(Labsystems Oy, Helsinki, Finland) to measure the fluorescence of the sample and to quantify the 

25 concentration of DNA. A 5 /il to 10 aliquot of the reaction mixture was analyzed by 

electrophoresis on a 1 % agarose gel to determine which reactions were successful in extending the 
sequence. 

The extended nucleotides were desalted and concentrated, transferred to 3 84- well plates, 
digested with Cvill cholera vims endonuclease (Molecular Biology Research, Madison WI), and 
30 sonicated or sheared prior to religation into pUC 18 vector (Amersham Pharmacia Biotech). For 
shotgun sequencing, the digested nucleotides were separated on low concentration (0.6 to 0.8%) 
agarose gels, fragments were excised, and agar digested with Agar ACE (Promega). Extended 
clones were religated using T4 ligase (New England Biolabs, Beverly MA) into pUC 18 vector 
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(Amersham Pharmacia Biotech), treated with Piu DN A polymerase (Stratagene) to fill-in restriction 
site overhangs, and transfected into competent E. coli cells. Transformed cells were selected on 
antibiotic-containing media, and individual colonies were picked and cultured overnight at 37 °C in 384- 
well plates in LB/2x carb liquid media. 

5 The cells were lysed, and DNA was amplified by PGR using Taq DNA polymerase 

(Amersham Pharmacia Biotech) and Pfu DNA polymerase (Stratagene) with the following 
parameters: Step 1: 94 X, 3 min; Step 2: 94°C, 15 sec; Step 3: 60°C, 1 min; Step 4: 72 °C, 2 min; Step 
S: steps 2, 3, and 4 repeated 29 times; Step 6: 72 ""C, 5 min; Step 7; storage at 4°C. DNA was 
quantified by PICOGREEN reagent (Molecular Probes) as described above. Samples with low DNA 

10 recoveries were reampUfied using the same conditions as described above. Samples were diluted with 
20% dimethysulfoxide (1:2, v/v), and sequenced using DYENAMIC energy transfer sequencing 
primers and the DYENAMIC DIRECT kit (Amersham Pharmacia Biotech) or the ABI PRISM 
BIGDYE Terminator cycle sequencing ready reaction kit (Applied Biosystems). 

In like manner, fiill length polynucleotide sequences are verified using the above procedure or 

IS are used to obtain 5' regulatory sequences using Uie above procedure along with oligonucleotides 
designed for such extension, and an appropriate genomic library. 

IX. Labeling and Use of Individual Hybridization Probes 

Hybridization probes derived from SEQ ID NO:23*44 are employed to screen cDNAs, 
genomic DNAs, or mRNAs. Although the labeling of oligonucleotides, consisting of about 20 base 

20 pairs, is specifically described, essentially the same procedure is used with larger nucleotide 
fi'agments. Oligonucleotides are designed using state-of-the-art software such as OUGO 4.06 
software (National Biosciences) and labeled by combining 50 pmol of each oligomer, 250 fdCi of 
[y-32pj adenosine triphosphate (Amersham Pharmacia Biotech), and T4 polynucleotide kinase 
(DuPont NEN, Boston MA). The labeled oligonucleotides are substantially purified using a 

25 SEPHADEX G-25 superfine size exclusion dextran bead cohinm (Amersham Pharmacia Biotech). 
An aliquot containing 10^ counts per minute of the labeled probe is used in a typical membrane-based 
hybridization analysis of human genomic DNA digested with one of the following endonucleases: Ase 
I, Bgl n, Eco RI, Pst I, Xba I, or Pvu n (DuPont NEN). 

The DNA fi'om each digest is fi-actionated on a 0.7% agarose gel and transferred to nylon 

30 membranes (Nytran Phis, Schleicher & Schuell, Durham NH). Hybridization is carried out for 16 
hours at 40 ''C. To remove nonspecific signals, blots are sequentially washed at room temperature 
under conditions of up to, for example, 0.1 x saline sodium citrate and 0.5% sodium dodecyl sulfate. 
Hybridization patterns are visualized using autoradiography or an alternative imaging means and 
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compared. 

X. Microarrays 

The linkage or synthesis of array elements upon a microarray can be achieved utilizing 
photolithography, piezoelectric printing (ink-jet printing. See, e.g., Baldeschweiler, supra .), mechanical 

S microspotting technologies, and derivatives thereof. The substrate in each of the aforementioned 
technologies should be uniform and solid with a non-porous surface (Schena (1999), supra) . 
Suggested substrates include silicon, silica, glass slides, glass chips, and silicon wafers. Alternatively, a 
procedure analogous to a dot or slot blot may also be used to arrange and link elements to the surface 
of a substrate using thermal, UV, chemical, or mechanical bonding procedures. A typical array may 

10 be produced using available methods and machines well known to those of ordinary skill in tfie art and 
may contain any appropriate number of elements. (See, e.g., Schena, M. et al. (1995) Science 
270:467-470; Shalon, D. et al. (1996) Genome Res. 6:639-645; MarshaU, A. and J. Hodgson (1998) 
Nat. Biotechnol. 16:27-31.) 

Full length cDNAs, Expressed Sequence Tags (ESTs), or fragments or oligomers thereof may 

15 comprise the elements of the microarray. Fragments or oligomers suitable for hybridization can be 
selected using software well known in the art such as LASERGENE software (DNASTAR). The 
array elements are hybridized with polynucleotides in a biological sample. The polynucleotides in the 
biological sample are conjugated to a fluorescent label or other molecular tag for ease of detection. 
After hybridization, nonhybridized nucleotides from the biological sample are removed, and a 

20 fluorescence scatmer is used to detect hybridization at each array element. Alternatively, laser 
desorbdon and mass spectrometry may be used for detection of hybridization. The degree of 
complementarity and the relative abundance of each polynucleotide which hybridizes to an element on 
the microarray may be assessed. In one embodiment, microarray preparation and usage is described 
in detail below. 

25 Tissue or Cell Sample Preparation 

Total RNA is isolated from tissue samples using the guanidinhim thiocyanate method and 
poly(A)^ RNA is purified using the oligo-(dT) cellulose method. Each poly(A)^ RNA sample is 
reverse transcribed using MMLV reverse-transcriptase, 0.05 pg/pil oligo-(dT) primer (21mer), IX first 
strand buffer, 0.03 units/^1 RNase inhibitor, 500 tiM dATP, 500 /xM dGTP, 500 /iM dTTP, 40 liU 

30 dCTF, 40 iiM dCTP-Cy3 (BDS) or dCTP-Cy5 (Amersham Pharmacia Biotech). The reverse 
transcription reaction is performed in a 25 ml volume containing 200 ng poly(A)^ RNA with 
GEMBRIGHT kits (Incyte). Specific control poly(A)* RNAs are synthesized by in vitro transcription 
from non-coding yeast genomic DNA. After incubation at 37° C for 2 hr, each reaction sample (one 
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with Cy3 and another with Cy5 labeling) is treated with 2.5 ml of O.SM sodium hydroxide and 
incubated for 20 minutes at 85^ C to the stop the reaction and degrade the RNA. Samples are purified 
using two successive CHROMA SPIN 30 gel filtration spin columns (CLONTECH Laboratories, Inc. 
(CLONTECH), Palo Alto CA) and after combining, both reaction samples are ethanol precipitated 
5 using 1 ml of glycogen (1 mg/ml), 60 ml sodium acetate, and 300 ml of 100%^ethanol. The sample is 
then dried to completion using a SpeedVAC (Savant Instruments Inc., Holbrook NY) and resuspended 
in 14 /il 5X SSaO.2% SDS. 

Microarrav Preparation 

Sequences of the present invention are used to generate array elements. Each array element 
10 is amplified from bacterial cells containing vectors with cloned cDNA inserts. PGR amplification uses 
primers complementary to the vector sequences flanking the cDNA insert. Array elements are 
amplified in thirty cycles of PGR from an initial quantity of 1-2 ng to a final quantity greater than S ^g. 
Amplified array elements are then purified using SEPHACRYL-400 (Amersham Pharmacia Biotech). 
Purified array elements are immobilized on polymer-coated glass slides. Glass microscope 
15 slides (Coming) are cleaned by ultrasound in 0.1% SDS and acetone, with extensive distilled water 
washes between and after treatments. Glass slides are etched in 4% hydrofluoric acid (VWR 
Scientific Products Corporation (VWR), West Chester PA), washed extensively in distilled water, and 
coated with 0.05% aminopropyl silane (Sigma) in 95% ethanol. Coated slides are cured in a 1 10°C 
oven. 

20 Array elements are applied to the coated glass substrate using a procedure described in U.S. 

Patent No. 5,807,522, incorporated herein by reference. 1 ^1 of the array element DNA, at an average 
concentration of 100 ng/^1, is loaded into the open capillary printing element by a high-speed robotic 
apparatus. The apparatus then deposits about 5 nl of array element sample per slide. . 

Microarrays are UV-crosslinked using a STRATALINKER UV-crosslinker (Stratagene). 

25 Microarrays are washed at room temperature once in 0.2% SDS and three times in distilled water. 
Non-specific binding sites are blocked by iiicubation of microarrays in 0.2% casein in phosphate 
buffered saline (PBS) (Tropix, Inc., Bedford MA) for 30 minutes at dO^'C followed by washes in 0.2% 
SDS and distilled water as before. 
Hybridization 

30 Hybridization reactions contain 9 nl of sample mixture consisting of 0.2 fig each of Cy3 and 

Cy5 labeled cDNA synthesis products in 5X SSC, 0.2% SDS hybridization buffer. The sample 
mixture is heated to 65"* C for 5 minutes and is aliquoted onto the microarray surface and covered with 
an 1.8 cm^ coverslip. The arrays are transferred to a waterproof chamber having a cavity just sligfhtly 
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larger flian a microscope slide- The chamber is kept at 100% humidity internally by the addition of 140 
III of 5X SSC in a comer of the chamber. The chamber containing the arrays is incubated for about 
6.5 hours at eO^'C. The arrays are washed for 10 mm at 45° C in a first wash buffer (IX SSC, 0.1% 
SDS), three times for 10 minutes each at 45** C in a second wash buffer (O.IX SSC), and dried. 
5 Detection 

Reporter-labeled hybridization complexes are detected with a microscope equipped with an 
Innova 70 mixed gas 10 W laser (Coherent, Inc., Santa Clara CA) capable of generating spectral Imes 
at 488 nm for excitation of Cy3 and at 632 nm for excitation of Cy5. The excitation laser light is 
focused on the array using a 20X microscope objective (Nikon, Inc., Melville NY). The slide 

10 containing the array is placed on a computer-controlled X-Y stage on the microscope and raster- 
scanned past die objective. The 1.8 cm x L8 cm array used m die present example is scanned witfi a 
resolution of 20 micrometers. 

In two separate scans, a mixed gas multiline laser excites the two fhiorophores sequentially. 
Emitted light is split, based on wavelengdi, into two photomultiplier tube detectors (PMT R1477, 

15 Hamamatsu Photonics Systems, Bridgewater NJ) conesponding to the two fhiorophores. Appropriate 
filters positioned between the array and the photomultiplier tubes are used to filter the signals. The 
emission maxima of the fhiorophores used are 565 nm for Cy3 and 650 nm for Cy5. Each array is 
typically scanned twice, one scan per fhiorophore using the appropriate filters at the laser source, 
although the apparatus is capable of recording the spectra from both fhiorophores simultaneously. 

20 The sensitivity of the scans is typically calibrated using the signal intensity generated by a 

cDNA control species added to the sample mixture at a known concentration. A specific location on 
the array contains a complementary DNA sequence, allowing the intensity of the signal at that location 
to be correlated with a weight ratio of hybridizing species of 1:100,000. When two samples from 
different sources (e.g., representing test and control ceUs), each labeled with a different fhiorophore, 

25 are hybridized to a single array for the purpose of identifying genes that are differentially expressed, 
the calibration is done by labeling samples of the calibrating cDNA with the two fhiorophores and 
adding identical amounts of each to the hybridization mixture. 

The output of flie photomultiplier tube is digitized using a 12-bit RTI-835H analog-to-digital 
(A/D) conversion board (Analog Devices, Inc., Norwood MA) installed in an IBM-compatible PC 

30 computer. The digitized data are displayed as an image where the signal intensity is mapped using a 
linear 20-color transformation to a pseudocolor scale rangmg from blue (low signal) to red (high 
signal). The data is also analyzed quantitatively. Where two different fhiorophores are excited and 
measured simultaneously, the data are first corrected for optical crosstalk (due to overlapping emission 
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spectra) between the fluorophores using each fhiorophore's emission spectrum. 

A grid is superimposed over the fluorescence signal image such that the signal from each spot 
is centered in each element of the grid. The fluorescence signal within each element is then integrated 
to obtain a numerical value corresponding to the average intensity of the signal. The software used 
S for signal analysis is the GEMTOOLS gene expression analysis program (Incyte). 

XI. Complementary Polynucleotides 

Sequences complementary to the PKIN-encoding sequences, or any parts thereof, are used to 
detect, decrease, or inhibit expression of naturally occurring PKIN. Although use of oligonucleotides 
comprising from about 15 to 30 base pairs is described, essentially the same procedure is used with 

10 smaller or with larger sequence fragments. Appropriate oligonucleotides are designed using OLIGO 
4.06 software (National Biosciences) and the coding sequence of PKIN. To inhibit transcription, a 
complementary oligonucleotide is designed from the most unique sequence and used to prevent 
promoter binding to the coding sequence. To inhibit translation, a complementary oligonucleotide is 
designed to prevent ribosomal binding to the PKIN-encoding transcript. 

15 XII. Expression of PKIN 

Expression and purification of PKIN is achieved using bacterial or viros-based expression 
systems. For expression of PKIN in bacteria, cDNA is subcloned into an appropriate vector 
containing an antibiotic resistance gene and an inducible promoter that directs high levels of cDNA 
transcription. Examples of such promoters include, but are not limited to, the trp-lac {tac) hybrid 

20 promoter and the TS or T7 bacteriophage promoter in conjunction with die lac operator regulatory 
element. Recombinant vectors are transformed into suitable bacterial hosts, e.g., BL21(DE3). 
Antibiotic resistant bacteria express PKIN upon induction with isopropyl beta-D-thiogalactopyranoside 
(IPTG). Expression of PKIN in eukaryotic cells is achieved by infecting insect or mammalian cell 
lines with recombinant Autographica califomica nuclear polyhedrosis vims (AcMNPV), commonly 

25 known as baculo virus. The nonessential polyfaedrin gene of baculovims is replaced with cDNA 
encoding PKIN by either homologous recombination or bacterial-mediated transposition involving 
transfer plasmid intermediates. Viral infectivity is maintained and the strong pol)iiedrin promoter 
drives high levels of cDNA transcription. Recombinant baculo virus is used to infect S podoptera 
frugjperda (Sf9) insect cells in most cases, or human hepatocytes, in some cases. Infection of the 

30 latter requires additional genetic modifications to baculovims. (See Engelhard, E.K. et aL (1994) Proc. 
Natl. Acad. Sci. USA 91:3224-3227; Sandig, V. et al. (1996) Hum. Gene Ther. 7:1937-1945.) 

In most expression systems, PKIN is synthesized as a fusion protein with, e.g., glutathione S* 
transferase (GST) or a peptide epitope tag, such as FLAG or 6-His, permitting rapid, single-step, 
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affinity-based purification of recombinant fusion protein from crude cell lysates. GST, a 26-kilodalton 
enzyme from Schistosoma iapomcum , enables the purification of fusion proteins on iomiobilized 
ghitadiione under conditions that maintain protein activity and antigenicity (Amersham Pharmacia 
Biotech). Following purification, the GST moiety can be proteolytically cleaved from PKIN at 

S specifically engineered sites. FLAG, an 8-amino acid peptide, enables immunoafiinity purification 
using conmiercially available monoclonal and polyclonal anti-FLAG antibodies (Eastman Kodak). 6- 
His, a stretch of six consecutive histidine residues, enables purification on metal-chelate resins 
(QIAGEN). Methods for protein expression and purification are discussed in Ausubel (1995, supra , 
ch. 10 and 16). Purified PKIN obtained by these methods can be used directly in the assays shown in 

10 Examples XVI, XVn, and XVin, where applicable. 

XIII. Functional Assays 

PKIN function is assessed by expressing the sequences encoding PKIN at physiolo^cally 
elevated levels in mammalian cell culture systems. cDNA is subcloned into a mammalian expression 
vector containing a strong promoter that drives high levels of cDNA expression. Vectors of choice 

15 include PCMV SPORT (life Technologies) and PCR3.1 (Invitrogen, Carlsbad CA), both of which 
contain the cytomegalovirus promoter. 5-10 /xg of recombinant vector are transiently transfected into 
a human cell line, for example, an endothelial or hematopoietic cell line, using either liposome 
formulations or electroporation. 1-2 /ig of an additional plasmid containing sequences encoding a 
marker protein are co-transfected. Expression of a marker protem provides a means to distinguish 

20 transfected cells fi-om nontransfected cells and is a reliable predictor of cDNA expression from the 
recombinant vector. Marker proteins of choice include, e.g.. Green Fluorescent Protein (GFP; 
Clontech), CD64, or a CD64-GFP fiision protein. Flow cytometry (FCM), an automated, laser optics- 
based technique, is used to identify transfected cells expressing GFP or CD64-GFP and to evahiate 
the apoptotic state of the cells and other cellular properties. FCM detects and quantifies the uptake of 

25 fluorescent molecules that diagnose events preceding or coincident with cell death. These events 
include changes in nuclear DNA content as measured by staining of DNA with propidium iodide; 
changes in cell size and granularity as measured by forward Ught scatter and 90 degree side light 
scatter; down-regulation of DNA synfliesis as measured by decrease in bromodeoxyuridine uptake; 
alterations in egression of cell surface and intracellular proteins as measured by reactivity with 

30 specific antibodies; and alterations in plasma membrane composition as measured by flie binding of 
fluorescein-conjugated Annexin V protein to the cell surface. Methods in flow cytometry are 
discussed in Qrmerod, M.G. (1994) Flow Cvtometrv . Oxford, New York NY. 

The influence of PKIN on gene expression can be assessed using highly purified populations 
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of cells transfected with sequences encoding PKIN and either CD64 or CD64-GFP. CD64 and 
CD64-GFP are expressed on the surface of transfected cells and bind to conserved regions of human 
immunoglobulin G (IgG). Transfected cells are efficiently separated from nontransfected cells using 
magnetic beads coated with either human IgG or antibody against CD64 (DYNAL, Lake Success 
S NY). mRNA can be purified from the cells using methods well known by those of skill in the art 
Expression of mRNA encoding PKIN and other genes of interest can be analyzed by northern 
analysis or microarray techniques. 

XIV. Production of PKIN Specific Antibodies 

PKIN substantially purified using polyacrylamide gel electrophoresis (PAGE; see, e.g., 
10 Harrington, M.G. (1990) Methods Enzymol. 182:488-495), or other purification techniques, is used to 
immunize rabbits and to produce antibodies using standard protocols. 

Altematively, the PKIN amino acid sequence is analyzed using LASERGENE software 
(DNASTAR) to determine regions of high immunogenicity, and a corresponding oligopeptide is 
synthesized and used to raise antibodies by means known to those of skill in the art Methods for 
IS selection of appropriate epitopes, such as those near the C-terminus or in hydrophilic regions are well 
described in the art (See, e.g., Ausubel, 199S, supra , ch. 11.) 

Typically, oligopeptides of about 15 residues in length are synthesized using an ABI 431 A 
. peptide synthesizer (Applied Biosystems) using FMOC chemistry and coupled to KLH (Sigma- 
Aldrich, St Louis MO) by reaction with N-maleimidobenzoyl-N-hydro3^uccinimide ester (MBS) to 
20 increase immunogenicity. (See, e.g., Ausubel, 1995, supra .) Rabbits are immunized with the 
oligopeptide-KLH complex in complete Freund's adjuvant Resulting antisera are tested for 
antipeptide and anti-PKIN activity by, for example, binding the peptide or PKIN to a substrate, 
blocking with 1% BSA, reacting with rabbit antisera, washing, and reacting with radio-iodinated goat 
anti-rabbit IgG. 

25 XV. Puriricatlon of Naturally Occurring PKIN Using Specific Antibodies 

Naturally occurring or recombinant PKIN is substantially purified by inununoaffinity 
chromatography using antibodies specific for PKIN. An immunoaffinity column is constmcted by 
covalently coupling anti-PKIN antibody to an activated chromatographic resin, such as 
CNBr-activated SEPHAROSE (Amersham Pharmacia Biotech). After the coupling, the resin is 
30 blocked and washed according to the manufacturer's instructions. 

Media containing PKIN are passed over the immunoaffinity column, and the column is 
washed under conditions that allow the preferential absorbance of PKIN (e.g., high ionic strength 
buffers in the presence of detergent). The column is ehited under conditions that disrupt 

■ 
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antibody/PKIN binding (e.g., a buffer of pH 2 to pH 3, or a high concentration of a chaotrope, such as 

urea or thiocyanate ion), and PKIN is collected. 

XVI. Identification of Molecules Which Interact with PKIN 

PKIN, or biologically active fragments thereof, are labeled with Bolton-Hunter reagent 
5 (See, e.g., Bolton, A.E. and W.M. Hunter (1973) Biodiem. J. 133:529-539.) Candidate molecules 
previously arrayed in the wells of a multi-well plate are incubated with the labeled PKIN, washed, and 

■ 

any wells with labeled PKIN complex are assayed. Data obtained using different concentrations of 
PKIN are used to calculate values for the number, affinity, and association of PKIN with the 
candidate molecules. 

10 Alternatively, molecules interacting with PKIN are analyzed using the yeast two-hybrid 

system as described in Fields, S. and O. Song (1989) Nature 340:245-246, or using commercially 
available kits based on the two-hybrid system, such as the MATCHMAKER system (Clontech). 

PKIN may also be used in the PATHCALUNG process (CuraGen Corp., New Haven CT) 
which employs the yeast two-hybrid system in a high-throughput manner to determine all interactions 

15 between the proteins encoded by two large libraries of genes (Nandabalan, K. et al. (2000) U.S. 
Patent No. 6,057,101). 

XVIL Demonstration of PKIN Activity 

Generally, protein kinase activity is measured by quantifying the phosphorylation of a protein 

20 substrate by PKIN in the presence of gamma-labeled ^^-ATP. PKIN is incubated with the protein 
substrate, ^^-ATP, and an appropriate kinase buffer. The incorporated into the substrate is 
separated from free ^^-ATP by electrophoresis and the incorporated ^^P is counted using a 
radioisotope counter. The amount of incorporated is proportional to the activity of PKIN. A 
determination of the specific amino acid residue phosphorylated is made by phosphoamino acid 

25 analysis of the hydrolyzed protein. 

In one altemative, protein kinase activity is measured by quantifying the transfer of gamma 
phosphate from adenosine triphosphate (ATP) to a serine, threonine or tyrosine residue in a protein 
substrate. The reaction occurs between a protein kinase sample with a biotinylated peptide substrate 
and gamma ATP. Following the reaction, free avidin in solution is added for binding to the 

30 biotinylated ^^-peptide product The binding san^>le then undergoes a centrifugal ultrafiltration 
process with a membrane which wiU retain the product-avidin complex and allow passage of free 
gamma ^^P-ATP. The reservoir of the centrifuged unit containing the ^^-peptide product as retentate 
is then counted in a scintillation counter. This procedure allows assay of any type of protein kinase 
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sample, depending on the peptide substrate and kinase reaction buffer selected. This assay is provided 
in kit form (ASUA, Affinity Ultrafiltration Separation Assay, Transbio Corporation, Baltimore MD, 
U.S. Patent No. 5,869,275). Suggested substrates and their respective enzymes are as follows: 
Histone HI (Sigma) and p34^^kinase, Annexin I, Angiotensin (Sigma) and EGF receptor kinase, 
5 Annexin n and src kinase, ERKl & ERK2 substrates and MEK, and myelin basic protein and ERK 
(Pearson, J.D. et al. (1991) Methods in Enzymology 200:62-81). 

In another alternative, protein kinase activity of PKIN is demonstrated in vitro in an assay 
containing PKIN, 50^1 of kinase buffer, l\ig substrate, such as myelin basic protein (MBP) or 
synthetic peptide substrates, 1 mM DTT, 10 Jig ATP, and 0.5nCi [y-^^PJATP. The reaction is 

10 incubated at 30 ""C for 30 minutes and stopped by pipetting onto P81 paper. The unincorporated [y- 
^^PJATP is removed by washing and the incorporated radioactivity is measured using a radioactivity 
scintillation counter. Alternatively, the reaction is stopped by heating to 100 °C in the presence of SDS 
loading buffer and visualized on a 12% SDS polyacrylamide gel by autoradiography. Incorporated 
radioactivity is corrected for reactions carried out in the absence of PKIN or in the presence of the 

IS inactive kinase, K38A. 

In yet another alternative, adenylate kinase or guanylate kinase activity may be measured by 
die incorporation of from gamma-labeled -ATP into ADP or GDP using a ganmia radioisotope 
counter. The enzyme, in a kinase buffer, is incubated together widi the appropriate nucleotide 
mono-phosphate substrate (AMP or GMP) and ^^-labeled ATP as the phosphate donor. The 

20 reaction is incubated at 3VC and terminated by addition of trichloroacetic acid. The acid extract is 
neutralized and subjected to gel electrophoresis to separate the mono-, di-, and triphosphonucleotide 
fractions. The diphosphonucleotide fraction is cut out and counted. The radioactivity recovered is 
proportional to the enzyme activity. 

In yet another alternative, other assays for PKIN include scintillation proximiQr assays (SPA), 

2S scintillation plate technology and filter binding assays. Usefiil substrates include recombinant proteins 
tagged with glutathione transferase, or synthetic peptide substrates tagged with biotin. Inhibitors of 
PKIN activity, such as small organic molecules, proteins or peptides, may be identified by such assays. 

Kinase activity of PKIN may be determined by its ability to convert polyphosphate substrate 
(PolyP) to ATP in the presence of ADP. PKIN and Poly P are incubated at 3TC for 40 minutes and 

30 then at 90°C for 2 minutes in a buffer containing 50 mM Tris-HCl, pH 7.4, 40 mM anmionium sulfate, 
4 mM MgClj, and 5 nM ADP. The reaction mixture is diluted 1:100 in 100 mM Tris-HCl (pH 8.0), 4 
mM EDTA, which is then diluted 1:1 in luciferase reaction mixture (ATP Biohiminescence Assay Kit 
CLS n; Boehiinger Mannheim). The ATP generated is then quantitated using a luminometer 
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(Romberg, A. et al. (1999) Annu. Rev. Biochem. 68:89-125; Ault-Riche, D. et aL (1998) J. Bacteriol. 
180:1841-1847). 

Kinase activity of PKIN, as measured by phosphorylation of substrate, may be determined 
using an immune complex kinase assay well imown in the art C0S7 cells are transfected with an 

S expression plasmid constructed from a FLAG tag expression vector (pM£18S-FLAG) containing 
PKIN DNA. A control transfection using vector alone without the PKIN DNA insert is done in 
parallel After 48 hours, the cells are lysed in buffer A (20 mM HEPES-NaOH, pH 7.5, 3 mM 
MgCl2, 100 mM NaGj* 1 dithiothreitol, 1 mM phenyhnethanesulfonyl fluoride, 1 ^g/ml leupeptin, ] 
mM EGTA, 1 mM Na3Vo4, 10 mM NaF, 20 mM ^-glycerophosphate, and 0.5% Triton X-100) and 

10 centrifuged at 14,000 ipnL Supematants are incubated with anti-FLAG antibody (M2 monoclonal 
antibody; Eastman Kodak Co.) in a 50% slurry of protein A-Sepharose (Amersham Pharmacia 
Biotech) for 1.5 hours at 4°C. Immune complexes are precipitated and washed twice in buffer A and 
twice m buffer B (20 mM HEPES-NaOH, pH 7.5, 1 mM dithiothreitol, 10 jiM Na3Vo4, 2 mM P- 
glycerophosphate, 0.1 mM phenyhnethanesulfonyl fhioride, 0.1 ^g/ml leupeptin, 0.1 mM EGTA.) 

15 Precipitates are incubated in buffer B containing 0. 17 mg/ml myelin basic protein (MBP) (Sigma), 20 
^M ATP, and 5 jiCi of [y-^^JATP (NEN life Science Products) at 30*^0 for 20 minutes. The 
reaction is stopped by the addition of 4X Laemmli sample buffer (50 mM Tris-HCl, pH 6.8, 2% SDS, 
30 mM didiiothreitol, and 10% glycerol) and heated at 95 °C for 5 minutes. Proteins are separated by 
SDS-polyacrylamide gel electrophoresis and radioactivity incorporated into MBP is detected by 

20 autoradiography (Nakano, K. et al. (2000) J. Biol. Chem. 275:20533-20539.) 

In yet another alternative, an assay for PanK activity of PKIN includes the enzyme 
preparation method as described in Vallari, D.S. et al., (1987) J. Biol. Chem. 262:2468-247. 
Pantothenate kinase-specific activities in cell lysates are calculated as a function of protein 
concentration with the assay being linear with respect to both time and protein input Protein 

25 concentrations are measured using the Bradford assay using bovine y-globulin as a standard. 

Standard assays contain D-[l-^^C]pantothcnate (45.5 |iM; specific activity 55 mCi/mmol), ATP (2.5 
mM, pH 7.0), MgC^a (2.5 mM), Tris-HCl (0.1 M, pH 7.5), and ISiig of protein from a sohible cell 
extract in a total volume of 40 ^1. The mixture is incubated for 10 miiL at 37 and the reaction is 
stopped by depositing a 30-/il aliquot onto a Whatman DE81 ion-exchange filter disc which is then 

30 washed in three changes of 1% acetic acid in 95% ethanol (25 ml/disc) to remove unreacted 

pantothenate. 4'-Fhosphopantothenate is quantitated by counting &e dried disc in 3 ml of scintillation 
solution (Rock, supraX 

XVIII. Enhancement/Inhibition of Protein Kinase Activity 
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Agonists or antagonists of PKIN activation or inhibition may be tested using assays described 
in section XVn. Agonists cause an increase in PKIN activity and antagonists cause a decrease in 
PKIN activity. 

Various modifications and variations of the described methods and systems of the invention 
will be apparent to those skilled in the art without departing from the scope and spirit of the invention. 
Although the invention has been described in connection with certain embodiments, it should be 
understood that the invention as claimed should not be unduly limited to such specific embodiments. 
Indeed, various modifications of the described modes for carrying out the invention which are obvious 
to those skilled in molecular biology or related fields are intended to be within the scope of the 
following claims. 
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Table 5 



rolynucleotiae 


incyte rroject 
ILi: 


Representative Library 


24 


74o304oCB 1 


/"vT^t nv\i 

COLCTUTu3 




71o3o374CBl 


CARDNOTOl 


26 


7480597CB 1 


DRGLNOTOl 


27 


3227248CB 1 


COTRNOTOl 


28 


4207273CB 1 


TESTNOCOl 


29 


7483334CB1 


ADRENOT03 


30 


7483337CB1 


UTRSTMR02 


31 


6035509CB I 


PITUNOT06 


32 


7373485CB 1 


MCLDTXT02 


33 


5734965CB1 


PROSTUS23 


34 


7473788CB1 


BRAINOT19 


35 


3107989CB1 


Pw^ ^^^W ^^^^^^^^^ ^^^^ ^^^^fc 

STOMFET02 


J / 


Zyo3414Ci>l 


6L.UKNU1(J4 


38 


7477139CB1 


PLACFER06 


39 


55009053CB 1 


SINITMEOl 


41 


7483053CB1 


BRAYDIN03 


42 


7483117CB1 


ADREFECOl 


43 


7484498CB1 


BRAITDR03 


44 


7638121CB1 


MUSLTDR02 
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What is claimed is: 



1. An isolated polypeptide selected from the group consisting of: 

a) a polypeptide comprising an amino acid sequence selected from the group consisting 
5 ofSEQIDNO:l-22, 

b) a polypeptide comprising a naturally occurring amino acid sequence at least 90% 
identical to an amino acid sequence selected from the group consisting of SEQ ID 
NO:l-22, 

c) a biologically active fragment of a polypeptide having an amino acid sequence 
10 selected from the group consisting of SEQ ID NO: 1 -22, and 

d) an immunogenic fragment of a polypeptide having an amino acid sequence selected 
from the group consisting of SEQ ID NO: 1-22. 



2. An isolated polypeptide of claim 1 comprising an amino acid sequence selected from the 
IS group consisting of SEQ ID NO: 1-22. 



3. An isolated polynucleotide encoding a polypeptide of claim 1. 

4. An isolated polynucleotide encoding a polypeptide of claim 2. 

20 

5. An isolated polynucleotide of claim 4 comprising a polynucleotide sequence selected from 
the group consisting of SEQ ID NO:23-44. 



6. A recombinant polynucleotide comprising a promoter sequence operably linked to a 
25 polynucleotide of claim 3. 



30 



1. A cell transformed with a recombinant polynucleotide of claim 6. 



8. A transgenic organism comprising a recombinant polynucleotide of claim 6. 



9. A method of producing a polypeptide of claim 1, the method comprising: 
a) culturing a cell under conditions suitable for expression of the polypeptide, wherein 
said cell is transformed with a recombinant polynucleotide, and said recombinant 



124 



wo 02/33099 



PCTAJSOl/47728 



polynucleotide comprises a promoter sequence operably linked to a polynucleotide 
encoding the polypeptide of claim 1, and 
b) recovering the polypeptide so expressed. 

10. A method of claim 9, wherein the polypeptide has an amino acid sequence selected from 
the group consisting of SEQ ID NO: 1-22. 

1 1. An isolated antibody which specifically binds to a polypeptide of claim 1. 

12. An isolated polynucleotide selected from die group consisting of: 

a) a polynucleotide comprising a polynucleotide sequence selected from the group 
consisting of SEQ ID NO:23-44, 

b) a polynucleotide comprising a naturally occurring polynucleotide sequence at least 
90% identical to a polynucleotide sequence selected from the group consisting of SEQ 
ID NO:23-44, 

c) a polynucleotide complementary to a polynucleotide of a), 

d) a polynucleotide complementary to a polynucleotide of b), and 

e) an RNA equivalent of a)-d). 

13. An isolated polynucleotide comprising at least 60 contiguous nucleotides of a 
polynucleotide of claim 12. 

14. A method of detecting a target polynucleotide in a sample, said target polynucleotide 
having a sequence of a polynucleotide of claim 12, the method comprising: 

a) hybridizing the sample with a probe comprising at least 20 contiguous nucleotides 
comprising a sequence complementary to said target polynucleotide in the sample, and 
which probe specifically hybridizes to said target polynucleotide, under conditions 
whereby a hybridization complex is formed between said probe and said target 
polynucleotide or fragments thereof, and 

b) detecting the presence or absence of said hybridization complex, and, optionally, if 
present, the amount thereof. 

15. A method of claim 14, wherein the probe comprises at least 60 contiguous nucleotides. 
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16. A method of detecting a target polynucleotide in a sample* said target polynucleotide 
having a sequence of a polynucleotide of claim 12, the method comprising: 

a) amplifying said target polynucleotide or fragment thereof using polymerase chain 
reaction amplification, and 

S b) detecting the presence or absence of said amplified target polynucleotide or fragment 

thereof, and, optionally, if present, the amount thereof. 

17. A composition comprising a polypeptide of claim 1 and a pharmaceutically acceptable 
excipient 

10 

18. A composition of claim 17, wherein the polypeptide has an amino acid sequence selected 
from the group consisting of SEQ ID NO: 1-22. 

19. A method for treating a disease or condition associated with decreased expression of 

IS functional PKIN, comprising administering to a patient in need of such treatment the composition of 
claim 17. 

20. A method of screening a compound for effectiveness as an agonist of a polypeptide of 
claim 1, the method comprising: 

20 a) exposing a sample comprising a polypeptide of claim 1 to a compound, and 

b) detecting agonist activity in the sample. 

21. A composition comprising an agonist compound identified by a method of claim 20 and a 
pharmaceutically acceptable excipient. 

25 

22. A method for treating a disease or condition associated with decreased expression of 
functional PKIN, comprising administering to a patient in need of such treatment a composition of 
claim 21. 

30 23. A method of screening a compound for effectiveness as an antagonist of a polypeptide of 

claim 1, the method comprising: 

a) exposing a sample comprising a polypeptide of claim 1 to a compound, and 

b) detecting antagonist activity in the sample. 
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24. A composition comprising an antagonist compound identified by a method of claim 23 and 
a phannaceutically acceptable excipient. 

25. A method for treating a disease or condition associated with overexpression of functional 
S PIGN, comprising administering to a patient in need of such treatment a composition of claim 24. 

26. A method of screening for a compound that specifically binds to the polypeptide of claim 
I J the method comprising: 

a) combining the polypeptide of claim 1 with at least one test compound under suitable 
10 conditions, and 

b) detecting binding of the polypeptide of claim 1 to the test compound, thereby 
identifying a compound diat specifically binds to the polypeptide of claim 1. 



27. A method of screening for a compound that modulates the activity of the polypeptide of 
IS claim 1, the method comprising: 

a) combining the polypeptide of claim 1 with at least one test compound under conditions 
permissive for the activity of the polypeptide of claim 1, 

b) assessing the activity of the polypeptide of claim 1 in the presence of the test 
compound, and 

20 c) comparing the activity of the polypeptide of claim 1 in the presence of the test 

compound with die activity of the polypeptide of claim 1 in the absence of the test 
compound, wherein a change in the activity of the polypeptide of claim 1 in the 
presence of the test compound is indicative of a compound that modulates the activity 
of the polypeptide of claim 1. 

25 

28. A method of screening a compound for effectiveness in altering expression of a target 
polynucleotide, wherein said target polynucleotide comprises a sequence of claim 5, the method 
comprising: 

a) exposing a sample comprising the target polynucleotide to a compound, under 
30 conditions suitable for the expression of the target polynucleotide, 

b) detecting altered expression of the target polynucleotide, and 

c) comparing the expression of the target polynucleotide in the presence of varying 
amounts of the compound and in the absence of the compound. 
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29. A method of assessing toxicity of a test compound, the method comprising: 

a) treating a biological sample containing nucleic acids with the test compound, 

b) hybridizing the nucleic acids of the treated biological sample with a probe comprising 
at least 20 contiguous nucleotides of a polynucleotide of claim 12 under conditions 
whereby a specific hybridization complex is formed between said probe and a target 
polynucleotide in the biological sample, said target polynucleotide comprising a 
polynucleotide sequence of a polynucleotide of claim 12 or fragment thereof, 

c) quantifying the amount of hybridization complex, and 

d) comparing the amount of hybridization complex in the treated biological sample with 
the amount of hybridization complex in an untreated biological sample, wherein a 
difference in the amount of hybridization complex in the treated biological sample is 
indicative of toxicity of the test compound. 

30. A diagnostic test for a condition or disease associated with the expression of PKIN in a 
biological sample, the method comprising: 

a) combining the biological sample with an antibody of claim 1 1, under conditions suitable 
for the antibody to bind the polypeptide and form an antibodyipolypeptide complex, 
and 

b) detecting the complex, wherein the presence of the complex correlates with the 
presence of the polypeptide in the biological sample. 

31. The antibody ofclaim 11, wherein the antibody is: 

a) a chimeric antibody, 

b) a single chain antibody, 

c) a Fab fragment, 

d) a F(ab' )2 fragment, or 

e) a humanized antibody. 

32. A composition comprising an antibody of claim 11 and an acceptable excipient 

33. A method of diagnosing a condition or disease associated with the expression of PKIN in 
a subject, comprising admioistering to said subject an effective amount of the composition of claim 32. 
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34. A composition of claim 32, wherein the antibody is labeled. 

35. A method of diagnosing a condition or disease associated with the expression of PKIN in 
a subject, comprising administering to said subject an effective amount of the composition of claim 34. 

36. A method of preparing a polyclonal antibody with the specificity of the antibody of claim 
1 1» the method comprising: 

a) immunizing an animal widi a polypeptide having an amino acid sequence selected 
from the group consisting of SEQ ID NO: 1-22, or an immunogenic fragment thereof, 
under conditions to elicit an antibody response, 

b) isolating antibodies from said animal, and 

c) screening die isolated antibodies with the polypeptide, thereby identifying a polyclonal 
antibody which binds specifically to a polypeptide having an amino acid sequence 
selected from the group consisting of SEQ ID NO:l-22. 

37. A polyclonal antibody produced by a method of claim 36. 

38. A composition comprising the polyclonal antibody of claim 37 and a suitable canier. 

39. A method of making a monoclonal antibody with the specificity of the antibody of claim 
11, the method comprising: 

a) immunizing an animal with a polypeptide having an amino acid sequence selected 
from the group consisting of SEQ ID NO:l-22, or an immunogenic fragment thereof, 
under conditions to eUcit an antibody response, 

b) isolating antibody producing cells from the animal, 

c) fusing the antibody producing cells with immortalized cells to form monoclonal 
antibody-producing hybridoma cells, 

d) culturing the hybridoma cells, and 

e) isolating fi'om the culture monoclonal antibody which binds specifically to a 
polypeptide having an amino acid sequence selected from the group consisting of SEQ 
IDNO:l-22. 

40. A monoclonal antibody produced by a method of claim 39. 
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41. A composition comprising the monoclonal antibody of claim 40 and a suitable carrier. 

42. The antibody of claim 11, wherein the antibody is produced by screening a Fab expression 

library. 

5 

43. The antibody of claim 11, wherein the antibody is produced by screening a recombinant 
immunoglobulin library. 

44. A method of detecting a polypeptide having an amino acid sequence selected from the 
10 group consisting of SEQ ID NO: 1-22 in a sample, the method comprising: 

a) incubating the antibody of claim 1 1 with a sample under conditions to allow specific 
binding of the antibody and die polypeptide, and 

b) detecting specific binding, wherein specific binding indicates die presence of a 
polypeptide having an amino acid sequence selected from the group consisting of SEQ 

15 ID NO:l-22 in the sample. 

45. A method of purifying a polypeptide having an amino acid sequence selected from the 
group consisting of SEQ ID NO: 1-22 from a sample, the method comprising: 

a) incubating the antibody of claim 1 1 with a sample under conditions to allow specific 
20 binding of the antibody and the polypeptide, and 

b) separating the antibody from the sample and obtaining the purified polypeptide having 
an amino acid sequence selected from die group consisting of SEQ ID NO: 1-22. 



46. A microarray wherein at least one element of the microarray is a polynucleotide of claim 

25 13. 



47. A method of generating a transcript image of a sample which contains polynucleotides, 
the method comprising: 

a) labeling the polynucleotides of die sample, 
30 b) contacting the elements of the microarray of claim 46 with die labeled polynucleotides 

of the sample under conditions suitable for the formation of a hybridization complex, 
and 

c) quantifying the expression of the polynucleotides in the sample. 
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48. An array comprising different nucleotide molecules affixed in distinct physical locations 
on a solid substrate, wherein at least one of said nucleotide molecules comprises a first oligonucleotide 
or polynucleotide sequence specifically hybridizable with at least 30 contiguous nucleotides of a target 
polynucleotide, and wherein said target polynucleotide is a polynucleotide of claim 12. 

5 

49. An array of claim 48, wherein said first oligonucleotide or polynucleotide sequence is 
completely complementary to at least 30 contiguous nucleotides of said target polynucleotide. 

50. An array of claim 48, wherein said first oligonucleotide or polynucleotide sequence is 
10 completely complementary to at least 60 contiguous nucleotides of said target polynucleotide. 

51. An array of claim 48, wherein said first oligonucleotide or polynucleotide sequence is 
completely complementary to said target polynucleotide. 

IS 52. An array of claim 48, which is a microarray. 

53. An array of clain^ 48, further comprising said target polynucleotide hybridized to a 
nucleotide molecule comprising said first oligonucleotide or polynucleotide sequence. 

20 54. An array of claim 48, wherein a linker joins at least one of said nucleotide molecules to 

said solid substrate. 

55. An array of claim 48, wherein each distinct physical location on the substrate contains 
multiple nucleotide molecules, and the multiple nucleotide molecules at any single distinct physical 

25 location have the same sequence, and each distinct physical location on the substrate contains 

nucleotide molecules having a sequence which differs from the sequence of nucleotide molecules at 
another distinct physical location on the substrate. 

56. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID N0:1. 

30 

57. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID N0:2. 

58. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:3. 
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59. A polypeptide of claim 



60. A polypeptide of claim 



61. A polypeptide of claim 



62. A polypeptide of claim 

63. A polypeptide of claim 

64. A polypeptide of claim 

65. A polypeptide of claim 

66. A polypeptide of claim 

67. A polypeptide of claim 

68. A polypeptide of claim 

69. A polypeptide of claim 

70. A polypeptide of claim 

71. A polypeptide of claim 

72. A polypeptide of claim 

73. A polypeptide of claim 



74. A polypeptide of claim 

75. A polypeptide of claim 
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, comprising the amino acid sequence of SEQ ID N0:4. 



, comprising the amino acid sequence of SEQ ID N0:5. 
, comprising the amino acid sequence of SEQ ID N0:6. 



, comprising the amino acid sequence of SEQ ID N0:7. 

, comprising the amino acid sequence of SEQ ID N0:8. 

» comprising the amino acid sequence of SEQ ID N0:9. 

, comprising the amino acid sequence of SEQ ID NO: 10. 

, comprising the amino acid sequence of SEQ ID N0:1 1. 

, comprising the amino acid sequence of SEQ ID NO: 12. 

» comprising the amino acid sequence of SEQ ID NO: 13. 

y comprising the amino acid sequence of SEQ ID NO: 14. 

» comprising the amino acid sequence of SEQ ID N0:15. 

, comprising die amino acid sequence of SEQ ID NO: 16. 

, comprising the amino acid sequence of SEQ ID N0:17. 

, comprising the amino acid sequence of SEQ ID NO: 18. 

, comprising the amino acid sequence of SEQ ID NO: 19. 

, comprising the amino acid sequence of SEQ ID NO:20. 
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76. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:21. 

77. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:22. 

78. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID NO:23. 

79. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID NO:24. 

80. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID NO:25. 
SLA polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID NO:26. 

82. A polynucleotide of claim 12, comprising die polynucleotide sequence of SEQ ID NO:27, 

83. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID NO:28. 

84. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID NO:29. 

85. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID NO:30. 

86. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID N0:3 1 . 

87. A polynucleotide of claim 12, comprising flie polynucleotide sequence of SEQ ID NO:32. 

88. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID NO:33. 

89. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID NO:34. 

90. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID NO:35. 

91. A polynucleotide of claim 12, comprising flie polynucleotide sequence of SEQ ID NO:36. 

92. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID NO:37. 
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93. A polynucleotide of claim 12, comprising the polynucleotide sequence of S£Q ID NO:38. 

94. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID NO:39. 

95. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID NO:40. 

96. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID N0:41. 

97. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID NO:42. 

98. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID NO:43. 

99. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID NO:44. 
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<210> 1 
<211> 337 
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<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 7482896CD1 

<400> 1 

Met Thr Asn Asn Ser Gly Ser Lys Ala Glu Leu Val Val Gly Gly 
15 10 15 
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Val Tyr Leu 
Lys Leu Olu 
Ser Lys Leu 
Met His Trp 
Asp Leu Leu 
Arg Arg Phe 
lie Ser Arg 
Asp lie Lys 
Asn Lys Leu 
Asp Asn Arg 
Leu lie Gly 
lie Glu Gin 
Phe Met Tyr 
Ala Met Thr 
Met Ser Thr 
Phe Ala Met 
Val Pro Asp 
Arg Thr Leu 
Leu Lys Gin 
Gly Gin Gin 
Lys Asn Asn 



Gly He Thr 

35 

Ser Gin Lys 
50 

Tyr Thr He 
65 

Tyr Gly Gin 
80 

Gly Pro Ser 
95 

Thr Met Lys 

110 
He Glu Tyr 

125 
Pro Asp Asn 

140 
Phe Leu He 

155 
Thr Arg Gin 

170 
Thr Val Arg 

185 
Ser Arg Arg 

200 
Phe Asn Arg 

215 
Lys Lys Gin 

230 
Pro Val Glu 

245 
Tyr Leu Asn 

260 
Tyr Met Tyr 

275 
Asn His Gin 

290 
Lys Ala Ala 

305 
Ala Gin Thr 

320 
Val Lys Asp 

335 



Thr Thr Asn 
Val Lys His 
Leu Gin Gly 
Glu Lys Asp 
Leu Glu Asp 
Thr Val Leu 
Val His Thr 
Phe Leu Met 
Asp Phe Gly 
His He Pro 
Tyr Ala Ser 
Asp Asp Met 
Thr Ser Leu 
Lys Tyr Glu 
Val Leu Cys 
Tyr Cys Arg 
Leu Arg Gin 
Tyr Asp Tyr 
Gin Gin Ala 
Gin Thr Gly 
Asn ■ 



Gly Glu Asp 
40 

Pro Gin Leu 
55 

Gly Val Gly 
70 

Asn Asn Val 
85 

Leu Phe Asn 
100 

Met Leu Ala 
115 

Lys Asn Phe 
130 

Gly Thr Gly 
145 

Leu Ala Lys 
160 

Tyr Arg Glu 

175 

He Asn Ala 
190 

Glu Ser Leu 
205 

Pro Trp Gin 

220 

Lys He Ser 
235 

Lys Gly Phe 
250 

Gly Leu Arg 

265 

Leu Phe Arg 
280 

Thr Phe Asp 
295 

Ala Ser Ser 
310 

Lys Gin Thr 
325 



Val Ala Val 

45 

Leu Tyr Glu 
60 

He Pro His 
75 

Leu Val Met 
90 

Phe Cys Ser 
105 

Asp Gin Met 
120 

Leu His Arg 
135 

Arg His Cys 

150 

Lys Tyr Arg 
165 

Asp Lys His 

180 

His Leu Gly 
195 

Gly Tyr Val 
210 

Gly Leu Arg 

225 

Glu Lys Lys 
240 

Pro Ala Olu 
255 

Phe Glu Glu 

270 

He Leu Phe 
285 

Trp Thr Met 
300 

Ser Gly Gin 
315 

Glu Lys Asn 
330 



<210> 2 

<211> 475 

<212> PRT 

<213> Homo sapiens 



<220> 

<221> misc_f eature 

<223> Incyte ID No: 7483046CD1 



<400> 2 

Met Glu Arg Arg Leu 
1 5 
Ala Gly Gly Cys Pro 



Arg Ala Leu Glu Gin 

10 

Gly Leu Asp Gly Leu 



Leu Ala Arg Gly Glu 

15 

Leu Asp Leu Leu Leu 
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20 

Ala Leu His His Glu 

35 

Ser Val Ala Gin Phe 

50 

Val Lys Glu Leu Arg 

65 

Val lie Qly Arg Qly 

80 

Arg Asp Thr Gly Gin 

95 

Qlu Met Leu Lys Arg 

110 

Asp Val Leu Val Lys 

125 

Tyr Ala Phe Gin Asp 

140 

Tyr Ala Qly Gly Asp 

155 

Arg Leu Pro Pro Glu 

170 

Leu Ala lie His Ser 

185 

Val Lys Pro Asp Asn 

200 

Leu Ala Asp Phe Gly 

215 

Val Asp Ser Ser Val 

230 

Glu lie Leu Gin Ala 

245 

Gin Cys Asp Trp Trp 

260 

Phe Gly Qlu Thr Pro 

275 

Gly Lys lie Met Asn 

290 

Val Pro Asp Val Pro 

305 

Leu Cys Arg Gin Qlu 

320 

Phe Arg Asn His Pro 

335 

Ala Ser Ser Thr Ala 

350 

Asp Thr Ser Asn Phe 

365 

Gly Thr Leu Pro Pro 

380 

Leu Pro Phe Val Gly 

395 

Glu Ser Ser Ser Glu 

410 

Cys Leu Glu Gin Glu 

425 

Ala Leu His Ala Pro 



25 

Leu Ser Ser Gly Pro 

40 

Leu Ser Trp Ala Ser 

55 

Leu Gin Arg Asp Asp 

70 

Ala Phe Gly Glu Val 

85 

lie Phe Ala Met Lys 

100 

Ala Glu Thr Ala Cys 

115 

Gly Asp Ser Arg Trp 

130 

Glu Glu Tyr Leu Tyr 

145 

Leu Leu Thr Leu Leu 

160 

Leu Ala Gin Phe Tyr 

175 

Leu His Gin Leu Gly 

190 

Val Leu Leu Asp Val 

205 

Ser Cys Leu Arg Leu 

220 

Ala Val Gly Thr Pro 

235 

Met Glu Glu Gly Lys 

250 

Ser Leu Gly Val Cys 

265 

Phe Tyr Ala Glu Ser 

280 

His Glu Asp His Leu 

295 

Ala Ser Ala Gin Asp 

310 

Glu Arg Leu Gly Arg 

325 

Phe Phe Glu Gly Val 

340 

Pro Tyr lie Pro Glu 

355 

Asp Val Asp Asp Asp 

370 

Pro Ser His Gly Ala 

385 

Phe Thr Tyr Thr Ser 

400 

Ala Trp Ala Ala Leu 

415 

Lys Val Glu Leu Ser 

430 

Thr Asp His Arg Glu 



30 

Leu Arg Arg Glu Arg 

45 

Pro Phe Val Ser Lys 

60 

Phe Glu lie Leu Lys 

75 

Thr Val Val Arg Gin 

90 

Met Leu His Lys Trp 

105 

Phe Arg Glu Glu Arg 

120 

Val Thr Thr Leu His 

135 

Leu Val Met Asp Tyr 

150 

Ser Arg Phe Glu Asp 

165 

Leu Ala Glu Met Val 

180 

Tyr Val His Arg Asp 

195 

Asn Gly His lie Arg 

210 

Asn Thr Asn Qly Met 

225 

Asp Tyr lie Ser Pro 

240 

Gly His Tyr Gly Pro 

255 

Ala Tyr Glu Leu Leu 

270 

Leu Val Glu Thr Tyr 

285 

Gin Phe Pro Pro Asp 

300 

Leu lie Arg Gin Leu 

315 

Gly Gly Leu Asp Asp 

330 

Asp Trp Glu Arg Leu 

345 

Leu Arg Gly Pro Met 

360 

Thr Leu Asn His Pro 

375 

Phe Ser Gly His His 

390 

Gly Ser His Ser Pro 

405 

Glu Arg Lys Leu Gin 

420 

Arg Lys His Gin Glu 

435 

Leu Glu Gin Leu Arg 
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440 

Lys Qlu Val Gin Thr 

455 

Ala His Pro His Pro 

470 



445 

Leu Arg Asp Arg Leu 

460 

Leu Leu Olu Phe Leu 

475 



450 

Pro Gly lie Pro Ser 

465 



<210> 3 

<211> 675 

<212> PRT 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<223> Incyte ID No: 71636374CD1 



<400> 3 

Met Thr Thr Ser His 
1 5 
Ser Olu Val Lys Asn 

20 

Lys His Arg Glu Met 

35 

Arg Met Met Pro lie 

50 

Gin Gin Gin Olu Asp 

65 

Lys Gin Glu Leu Asp 

80 

Ala Gin lie Pro Pro 

95 

Thr Glu Glu Oly He 

110 

He Leu Olu He Glu 

125 

His Thr Leu Val Asp 

140 

Leu Gin Leu Val Gin 

155 

His Asn Ala He Thr 

170 

Pro Leu He Ser Asn 

185 

Val Leu Lys Pro Val 

200 

Leu Leu Asn Thr Pro 

215 

Lys Val Ala Glu Gin 

230 

Arg Val Tyr Glu Ser 

245 

He Val Arg He Olu 

260 

Val Arg Asn Olu Met 

275 

Gly Gly Ala Ala Glu 

290 



Met Asn Gly His Val 

10 

Val Asp Leu Ala Ser 

25 

Ala Val Asp Cys Pro 

40 

Arg Arg Ser Ala Gin 

55 

Met Arg Arg Arg Arg 

70 

Leu Asn Ser Ser Met 

85 

Lys Thr Gly He Asp 

100 

Val Leu Glu Ser Pro 

115 

Asp Leu Phe Ser Ser 

130 

Ser Gin Ser Gin Glu 

145 

Asn Lys Asp Phe Gin 

160 

Val His Met Asn Lys 

175 

Ala Gin Asp Leu Ala 

190 

His His Lys Glu Gly 

205 

His He Gin Ala Leu 

220 

Glu Met Gin Leu Glu 

235 

He Gly Gin Tyr Gly 

250 

Lys Ala Arg Asp He 

265 

Asp Ser Val He He 

280 

Lys Ser Gly Leu Leu 

295 



Thr Glu Glu Ser Asp 

15 

Pro Glu Glu His Gin 

30 

Gly Asp Leu Gly Thr 

45 

Leu Glu Arg He Arg 

60 

Glu Glu Glu Oly Lys 

75 

Arg Leu Lys Lys Leu 

90 

Asn Pro Met Phe Asp 

105 

His Tyr Ala Val Lys 

120 

Leu Lys His He Gin 

135 

Asp He Ser Leu Leu 

150 

Asn Ala Phe Lys He 

165 

Ala Ser Pro Pro Phe 

180 

Gin Glu Val Gin Thr 

195 

Gin Glu Leu Thr Ala 

210 

Leu Leu Ala His Asp 

225 

Pro He Thr Asp Glu 

240 

Gly Glu Thr Val Lys 

255 

Pro Leu Gly Ala Thr 

270 

Ser Arg He Val Lys 

285 

His Glu Gly Asp Glu 

300 
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Val Leu 


Glu lie 


Asn 


Gly He Glu He 


Arg 


Gly Lys 


Asp 


Val 


Asn 






305 




310 










3l!> 


Glu Val 


Phe Asp 


Leu 


Leu Ser Asp Met 


His 


Gly Thr Leu Thr 


File 






320 




325 










■3 T rt 

330 


Val Leu 


lie Pro 


Ser 


Gin Gin He Lys 


Pro 


Pro 


Pro 


Ala 


Lys 


Glu 






335 




340 










345 


Thr Val 


lie His 


Val 


Lys Ala His Phe 


Asp 


Tyr 


Asp 


Pro 


Ser 


Asp 






350 




355 










^ ^ f\ 
360 


Asp Pro 


Tyr Val 


Pro 


Cys Arg Glu Leu Gly 


Leu 


Ser 


Phe 


Gin 


Lys 






365 




370 










375 


Gly Asp 


lie Leu 


His 


Val He Ser Gin 


Glu 


Asp 


Pro 


Asn 


Trp 


Trp 






380 




385 










390 


Gin Ala 


Tyr Arg 


Glu 


Gly Asp Glu Asp Asn 


Gin 


Pro 


Leu 


Ala 


Gly 






395 




400 










405 


Leu Val 


Pro Gly 


Lys 


Ser Phe Gin Gin 


Gin 


Arg 


Glu 


Ala 


Met 


Lys 






410 




415 










420 


Gin Thr 


lie Glu 


Glu 


Asp Lys Glu Pro 


.Glu 


Lys 


Ser Gly Lys 


Leu 






425 




430 










435 


Trp Cys 


Ala Lys 


Lys 


Asn Lys Lys Lys 


Arg 


Lys 


Lys 


Val 


Leu 


Tyr 






440 




445 










450 


Asn Ala 


Asn Lys 


Asn 


Asp Asp Tyr Asp Asn 


Glu 


Glu 


He 


Leu 

• 


Thr 






455 




460 










465 


Tyr Glu Glu Met 


Ser 


Leu Tyr His Gin 


Pro 


Ala 


Asn 


Arg 


Lys 


Arg 






470 




475 










480 


Pro lie 


lie Leu 


He 


Gly Pro Gin Asn 


Cys 


Gly Gin Asn 


Glu 


Leu 






485 




490 










495 


Arg Gin Arg Leu 


Met 


Asn Lys Glu Lys 


Asp 


Arg 


Phe 


Ala 


Ser 


Ala 






500 




505 










510 


Val Pro 


His Thr 


Thr 


Arg Ser Arg Arg 


Asp 


Gin 


Glu 


Val 


Ala 


Gly 






515 




520 










^ 

525 


Arg Asp 


Tyr His 


Phe 


Val Ser Arg Gin 


Ala 


Phe 


Glu 


Ala 


Asp 


He 






530 




535 










540 


Ala Ala Gly Lys 


Phe 


He Glu His Gly Glu 


Phe 


Glu 


Lys 


Asn 


Leu 






545 




550 










^ ^ ^ 

555 


Tyr Gly Thr Ser 


He 


Asp Ser Val Arg 


Gin 


Val 


He 


Asn 


Ser 


Gly 






560 




565 










570 


Lys lie 


Cys Leu 


Leu 


Ser Leu Arg Thr 


Gin 


Ser 


Leu 


Lys 


Thr 


Leu 






575 




580 










585 


Arg Asn 


Ser Asp 


Leu 


Lys Pro Tyr He 


He 


Phe 


He 


Ala 


Pro 


Pro 






590 




595 










600 


Ser Gin 


Glu Arg 


Leu 


Arg Ala Leu Leu 


Ala 


Lys 


Glu Gly Lys 


Asn 






605 




610 










615 


Pro Lys 


Pro Glu 


— ^ 

Glu 


Leu Arg Glu He 


He 


Glu 


Lys Thr Arg 


Glu 






620 




625 










630 


Met Glu 


Gin Asn 


Asn 


Gly His Tyr Phe Asp 


Thr 


Ala 


He 


Val 


Asn 






635 




640 










645 


Ser Asp 


Leu Asp 


Lys 


Ala Tyr Gin Glu 


Leu 


Leu 


Arg 


Leu 


He 


Asn 






650 




655 










660 


Lys Leu Asp Thr 


Glu 


Pro Gin Trp Val 


Pro 


Ser Thr Trp Leu 


Arg 






665 




670 










675 



<210> 4 
<211> 835 
<212> PRT 

<213> Homo sapiens 
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<220> 

<221> inisc_f eature 
<223> Incyte ID No: 



7480597CD1 



<400> 4 



Met 
1 

Ser 
Asp 
Arg 
Arg 
Arg 
Val 
Glu 
Gly 
Ser 
Leu 
Asn 
Glu 
Lys 
Arg 
Leu 
Cys 
Cys 
Ser 
Lys 
Leu 
He 
Arg 
Leu 
Val 
Ala 



Ala Glu Gly Lys Glu Gly Gin Val Pro Ser Tyr Met Asp Gly 

5 10 15 

Arg Gin Arg Glu Asn Glu Glu Asp Ala Lys Ala Glu Thr Pro 

20 25 30 

Val Thr He Arg Ser Tyr Glu He Tyr Ser Leu Pro Trp Asn 

35 40 45 

Gin Gin Gly Leu Cys Asp His Ser Leu Lys Tyr Leu Ser Ser 

50 55 60 

He Thr Glu Arg Lys Leu Gin Gly Ser Trp Leu Pro Ala Ser 

65 70 75 

Gly Asn Leu Glu Lys Pro Phe Leu Gly Pro Arg Gly Pro Val 

80 85 90 

Pro Leu Phe Cys Pro Arg Asn Gly Leu His Ser Ala His Pro 

95 100 105 

Asn Ser Pro Leu Lys Pro Arg Val Val Thr Val Val Lys Leu 

110 115 120 

Gly Gin Arg Pro Arg Lys He Thr Leu Leu Leu Asn Arg Arg 

125 130 135 

Val Gin Thr Phe Glu Gin Leu Leu Ala Asp He Ser Glu Ala 

140 145 150 

Gly Ser Pro Arg Trp Lys Asn Asp Arg Val Arg Lys Leu Phe 

155 160 165 

Leu Lys Gly Arg Glu He Arg Ser Val Ser Asp Phe Phe Arg 

170 175 180 

Gly Asp Ala Phe He Ala Met Gly Lys Glu Pro Leu Thr Leu 

185 190 195 

Ser He Gin Val Ala Val Glu Glu Leu Tyr Pro Asn Lys Ala 

200 205 210 

Ala Leu Thr Leu Ala Gin His Ser Arg Ala Pro Ser Pro Arg 

215 220 225 

Arg Ser Arg Leu Phe Ser Lys Ala Leu Lys Gly Asp His Arg 

230 235 240 

Gly Glu Thr Glu Thr Pro Lys Ser Cys Ser Glu Val Ala Gly 

245 250 255 

Lys Ala Ala Met Arg His Gin Gly Lys He Pro Glu Glu Leu 

260 265 270 

Leu Asp Asp Arg Ala Arg Thr Gin Lys Lys Trp Gly Arg Gly 

275 280 285 

Trp Glu Pro Glu Pro Ser Ser Lys Pro Pro Arg Glu Ala Thr 

290 295 300 

Glu Glu Arg His Ala Arg Gly Glu Lys His Leu Gly Val Glu 

305 310 315 

Glu Lys Thr Ser Gly Glu He He Arg Cys Glu Lys Cys Lys 

320 325 330 

Glu Arg Glu Leu Gin Gin Ser Leu Glu Arg Glu Arg Leu Ser 

335 340 345 

Gly Thr Ser Glu Leu Asp Met Gly Lys Gly Pro Met Tyr Asp 

350 355 360 

Glu Lys Leu Val Arg Thr Arg Ser Cys Arg Arg Ser Pro Glu 

365 370 375 

Asn Pro Ala Ser Gly Glu Glu Gly Trp Lys Gly Asp Ser His 
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Arg Ser Ser Pro 
Lys Ser Met Asp 



Ser His Ala Gin 



Val Leu Pro Val 



Thr Arg Pro Met 



Glu His Gin Ala 



Glu Lys Glu Ala 
Arg Arg Met Thr 
Glu Pro Lys Thr 
Qly Arg Lys Pro 
Lys His Tyr Glu 
Val Val Lys Glu 
Met Lys lie lie 
Val Asp Ser Glu 
lie Val Lys Leu 



Leu lie Leu Glu 



lie Glu Ser Val 



Met Asp Leu CyB 
Val His Arg Asp 
Glu Asp Lys Ser 
Lys His Val Val 
Tyr Val Ala Pro 
Val Asp Met Trp 
Gly Phe Pro Pro 



Phe Asn He He 



Trp Asp Asn He 
Leu Val Val Asp 



Gin His Pro Trp 



380 
Arg 
395 
Lys 
410 
Gly 
425 
Thr 
440 
Ser 
455 
Gly 
470 
Glu 
485 
Leu 
500 
Arg 
515 
Arg 
530 
Thr 
545 
Cys 
560 
Asp 
575 
He 
590 
His 
605 
Tyr 
620 
Lys 
635 
Lys 
650 
Leu 
665 
Thr 
680 
Arg 
695 
Glu 
710 
Ala 
725 
Phe 
740 
Gin 
755 
Ser 
770 
Pro 
785 
He 



Asn Pro Thr Gin 



Lys 
Ala 
Glu 
Arg 
Phe 
Lys 
Arg 
Pro 
Pro 
Qly 
Arg 
Lys 
Leu 
Glu 
Val 
Phe 
Ala 
Lys 
Thr 
Pro 
He 
Ala 
Arg 
Leu 
Asp 
Lys 
Glu 



Glu Asp Arg 
Ala 



Ala Lys 
Glu Gly 
Ser Lys 
Glu Lys 
Glu Lys 
Asp Asp 
Glu Glu 
Met Gly 
Arg Val 
His Arg 
Ser Arg 
He He 
Val Tyr 
Gin Gly 
Pro Glu 
Leu Val 
Pro Glu 
Leu Lys 
He Phe 
Leu Ser 
Gly Val 
Ser Pro 
Gly His 
Ala Ala 
Lys Arg 
Thr Ala 



Leu 

His 

Leu 

Lys 

Gin 

Asn 

He 

He 

Glu 

Leu 

Gin 

Glu 

Gly 

Pro 

His 

Asn 

Leu 

Thr 

Glu 

He 

Glu 

Phe 

Lys 

Tyr 

Gly 



385 
Glu 
400 
Qly 
415 
Lys 
430 
Arg 
445 
Gly 
460 
Arg 
475 
Pro 
490 
Pro 
505 
Lys 
520 
He 
535 
Gly 
550 
Thr 
565 
Lys 
580 
Ser 
595 
Thr 
610 
Asp 
625 
Asp 
640 
Met 
655 
Leu 
670 
Ala 
685 
Val 
700 
Lys 
715 
Leu 
730 
Arg 
745 
Glu 
760 
Asp 
775 
Thr 
790 
Lys 



Leu 
Pro 
Lys 
Glu 
Gly 
Arg 
Cys 
Ala 
Pro 
Ala 
Asp 
Arg 
Gly 
Leu 
Asp 
Leu 
Ala 
His 
Leu 
Asp 
Cys 
Qly 
Tyr 
Asp 
Phe 
Leu 
Ala 
Thr 



Arg Arg Pro 
Glu Asp Gin 
Asp Leu Val 
Val Lys Lys 
Trp Leu Leu 
Thr Arg Gly 
Met Ser Gly 
Lys Leu Glu 
Glu Arg Pro 
Ala Asn Val 
Gly Asn Phe 
Gin Ala 



Tyr 
Asp 
Pro 
Met Glu He 



Lys Glu 
Ser His 



Phe Asp Ala 
Ala Leu Met 
Asp Lys Ser 
Val Gin Arg 
Phe Gly Leu 
Gly Thr Pro 
Tyr Gly Leu 
He Leu Leu 
Gin Asp Glu 
Leu Pro Pro 
Val Ser Arg 
His Gin Val 
Asn Thr Val 



390 

Ser 

405 

Glu 

420 

Glu 

435 

Asp 

450 

Arg 

465 

Glu 

480 

Gly 

495 

Lys 

510 

Ser 

525 

Glu 

540 

Ala 

555 

Ala 

570 

Met 

585 

Asn 

600 

Tyr 

615 

He 

630 

He 

645 

He 

660 

Asn 

675 

Ala 

690 

Thr 

705 

Glu 

720 

Cys 

735 

Leu 

750 

Tyr 

765 

Leu 

780 

Leu 

795 

Lys 
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800 805 810 

Arg Gin Lys Gin Val Ser Pro Ser Ser Glu Gly His Phe Arg Ser 

815 820 825 

Qln His Lys Arg Val Val Glu Gin Val Ser 

830 835 



<210> 5 
<211> 373 
<212> PRT 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<223> Incyte ID No: 3227248CD1 



<400> 5 
Met Lys Leu 

. 1 

Met Asp lie 
Lys Asp lie 
Lys Ser lie 
Thr Gly He 
Cys Gly Arg 
Ala Met His 
Ser Leu His 
Phe Glu Glu 
Leu Asp Glu 
Ser Val Gly 
Pro Thr Asn 
Asn Pro Tyr 
He Leu Ala 
Thr Ser Leu 
Thr Gly Cys 
Gly Asp Ser 
Gly Asp Tyr 
Ser Phe Gly 
Lys Glu Asp 



He Asn Gly 
5 

Gly Gly Thr 
20 

Thr Ala Glu 
35 

Arg Lys Tyr 
50 

Arg Asp Val 

65 

Lys Gly Asn 
80 

Arg Phe He 
95 

Thr Thr Leu 

110 
Asp Phe Arg 

125 
Leu Asp Cys 

140 
Phe Asn Gly 

155 
Pro Glu Leu 

170 
Pro Met Leu 

185 

Val Tyr Ser 

200 
Gly Gly Gly 

215 
Glu Thr Phe 

230 
Thr Asn Val 

245 
Glu Arg Phe 

260 
Asn Met Met 

275 
Leu Ala Arg 

290 



Lys Lys Gin 
Leu Val Lys 
Glu Glu Gin 
Leu Thr Ser 
His Leu Glu 
Leu His Phe 
Gin Met Gly 
Cys Ala Thr 
Met He Ala 
Leu He Gin 
Lys Pro Glu 
Cys Gin Lys 
Leu Val Asn 
Lys Asp Asn 
Thr Phe Leu 
Glu Glu Ala 
Asp Lys Leu 
Gly Leu Gin 
Ser Lys Glu 
Ala Thr Leu 



Thr Phe Pro 
10 

Leu Val Tyr 
25 

Glu Glu Val 
40 

Asn Thr Ala 
55 

Leu Lys Asn 

70 

He Arg Phe 
85 

Ser Glu Lys 
100 

Gly Gly Gly 
115 

Asp Leu Gin 
130 

Gly Leu Leu 
145 

Cys Tyr Tyr 
160 

Lys Pro Tyr 
175 

Met Gly Ser 
190 

Tyr Lys Arg 
205 

Gly Leu Cys 
220 

Leu Glu Met 

235 

Val Lys Asp 
250 

Gly Ser Ala 
265 

Lys Arg Asp 

280 

Val Thr He 
295 



Trp Phe Gly 
15 

Phe Glu Pro 
30 

Glu Asn Leu 
45 

Tyr Gly Lys 
60 

Leu Thr Met 
75 

Pro Ser Cys 
90 

Asn Phe Ser 
105 

Ala Phe Lys 

120 

Leu His Lys 
135 

Tyr Val Asp 
150 

Phe Glu Asn 

165 

Cys Leu Asp 
180 

Gly Val Ser 
195 

Val Thr Qly 
210 

Cys Leu Leu 
225 

Ala Ala Lys 
240 

He Tyr Gly 
255 

Val Ala Ser 
270 

Ser He Ser 
285 

Thr Asn Asn 
300 
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He Gly Ser He Ala Arg Met Cys Ala Leu Asn Glu Asn He Asp 

305 310 315 

Arg Val Val Phe Val Qly Asn Phe Leu Arg He Asn Met Val Ser 

320 325 330 

Met Lys Leu Leu Ala Tyr Ala Met Asp Phe Trp Ser Lys Gly Qln 

335 340 345 

Leu Lys Ala Leu Phe Leu Glu His Glu Gly Tyr Phe Gly Ala Val 

350 355 360 

Gly Ala Leu Leu Glu Leu Phe Lys Met Thr Asp Asp Lys 

365 370 

<210> 6 

<211> 735 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> inisc_f eature 

<223> Incyte ID No: 4207273CD1 

<400> 6 



Met 


Pro 


Gin 


He 


Ala 


Lys 


Lys 


Gin 


Ser 


Thr 


His 


Arg 


Thr 


Gin 


Lys 


1 








5 










10 










15 


Pro 


Lys 


Lys 


Gin 


Ser 


Phe 


Pro 


Cys 


He 


Cys 


Lys 


Asn 


Pro Gly Thr 










20 










25 










30 


Gin 


Lys 


Ser Cys 


Val 


Pro 


Leu 


Ser 


Val 


Gin 


Pro 


Thr 


Glu 


Pro 


Arg 










35 










40 










45 


Leu 


Asn 


Tyr 


Leu Asp 


Leu 


Lys 


Tyr 


Ser Asp Met 


Phe 


Lys 


Glu 


He 










50 










55 










60 


Asn 


Ser 


Thr 


Ala 


Asn Gly 


Pro Gly 


He Tyr Glu 


Met 


Phe Gly Thr 










65 










70 










75 


Pro 


Val 


Tyr 


Cys 


His 


Val 


Arg 


Glu 


Thr 


Glu 


Arg 


Asp 


Glu 


Asn 


Thr 










80 










85 










90 


Tyr Tyr 


Arg 


Glu 


He Cys 


Ser 


Ala 


Pro 


Ser Gly Arg 


Arg 


He 


Thr 










95 










100 










105 


Asn 


Lys 


Cys 


Arg 


Ser 


Ser 


His 


Ser 


Glu 


Arg 


Lys 


Ser 


Asn 


He 


Arg 










110 










115 










120 


Thr 


Arg 


Leu 


Ser 


Gin 


Lys 


Lys 


Thr 


His 


Met 


Lys 


Cys 


Pro 


Lys 


Thr 










125 










130 










135 


Ser 


Phe 


Gly 


He 


Lys 


Gin 


Glu 


His 


Lys 


Val 


Leu 


He 


Ser 


Lys 


Glu 










140 










145 










150 


Lys 


Ser 


Ser 


Lys 


Ala 


Val 


His 


Ser 


Asn 


Leu 


His 


Asp 


He 


Glu 


Asn 










155 










160 










165 


Gly Asp 


Gly 


He 


Ser 


Glu 


Pro 


Asp 


Trp 


Gin 


He 


Lys 


Ser 


Ser 


Gly 










170 










175 










180 


Asn 


Glu 


Phe 


Leu 


Ser 


Ser 


Lys 


Asp 


Glu 


He 


His 


Pro 


Met 


Asn 


Leu 










185 










190 










195 


Ala 


Gin 


Thr 


Pro 


Glu 


Gin 


Ser 


Met 


Lys 


Gin 


Asn 


Glu 


Phe 


Pro 


Pro 










200 










205 










210 


Val 


Ser 


Asp 


Leu 


Ser 


He 


Val 


Glu 


Glu 


Val 


Ser 


Met 


Glu 


Glu 


Ser 










215 










220 










225 


Thr Gly 


Asp 


Arg Asp 


He 


Ser 


Asn 


Asn 


Gin 


He 


Leu 


Thr 


Thr 


Ser 










230 










235 










240 


Leu 


Arg 


Asp 


Leu 


Gin 


Glu 


Leu 


Glu 


Glu 


Leu 


His 


His 


Gin 


He 


Pro 










245 










250 










255 


Phe 


He 


Pro 


Ser 


Glu 


Asp 


Ser Trp 


Ala 


Val 


Pro 


Ser 


Glu 


Lys 


Asn 
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Ser Asn Lys 
Ser Lys Val 
Asp Ser Val 
Gin Ala Lys 
Val His Tyr 
Tyr Gin Met 
Gin Glu He 
Leu Leu Gly 
Asp Asn Asn 
Asn Leu Asn 
Met Gly Arg 
Gly Leu Arg 
Glu Lys Lys 
He Leu Trp 
Thr Val Tyr 
Lys Gin Val 
Glu Tyr Arg 
Lys His Val 
Asn Thr Val 
Ser Ser He 
Cys Lys Tyr 
Glu Asn Cys 
Leu Met Pro 
Arg Arg Leu 
Leu Lys Ser 
He Asn Glu 
Gly Cys Thr 
Ser Met Asp 



260 
Tyr Val Gin 

275 
Asn Ala Ser 

290 
Ser Asp His 

305 
Gin Glu Ser 

320 
Leu Asp His 

335 
Phe Gly Lys 

350 

Met Asp Ser 

365 
Cys Leu Ala 

380 
Ser Cys Gin 

395 
Leu Val Leu 

410 
Glu Thr Thr 

425 
He Tyr Asp 

440 
He Phe Ser 

455 
Thr Lys Gly 

470 
Cys Gly Leu 

485 
Ala Leu Asp 

500 
Lys Leu Gin 

515 
Asn He Val 

530 
Ser He Phe 

545 
He Asn Arg 

560 

Thr Lys Gin 

575 
Val Val His 

590 
Thr Gly He 

605 

Ala Trp Ala 
620 

Met His Gly 

635 
Ser Gly Tyr 

650 
Val Phe Glu 

665 
Arg Met Ala 



Gin Glu Lys 
Arg He Leu 
Ser Lys Thr 
Ala Ser Ser 
Asp Ser Leu 
Thr Leu Ser 
Val Asn Asn 
Ala Glu Leu 
Lys Met Ala 
Arg Trp Arg 
Lys Val Lys 
Arg Glu Glu 
Glu Asn Ser 
Glu He Leu 
Thr Ser Gin 
Thr Ser Asn 
Glu Glu Val 
Ala Tyr Leu 
Met Glu Phe 
Phe Gly Pro 
He Leu Gin 
Arg Asp He 
He Lys Leu 
Gly Leu Asn 
Thr Pro Tyr 
Gly Arg Lys 
Met Ala Thr 
Ala Met Phe 



265 

Gin Asn Thr 
280 

Thr Asn Asp 
295 

Leu Thr Asn 
310 

Gin Thr Tyr 
325 

Ala Asn Lys 
340 

Gly Thr Asn 

355 

Glu Glu Leu 
370 

Leu Ala Leu 
385 

Asn Glu Thr 

400 

Gly Ser Thr 
415 

He Gin Arg 
430 

Lys Phe Leu 
445 

Leu Lys Ser 
460 

Gly Lys Gly 

475 

Gly Gin Leu 
490 

Lys Leu Ala 
505 

Asp Leu Leu 
520 

Gly Thr Cys 
535 

Val Pro Gly 
550 

Leu Pro Glu 

565 

Gly Val Ala 
580 

Lys Gly Asn 
595 

He Asp Phe 

610 

Gly Thr His 
625 

Trp Met Ala 
640 

Ser Asp He 
655 

Gly Lys Pro 
670 

Tyr He Gly 



270 

Ala Ser Leu 
285 

Leu Glu Phe 

300 

Phe Ser Phe 
315 

Gin Tyr Trp 
330 

Ser He Thr 
345 

Ser He Ser 

360 

Thr Asp Glu 
375 

Asp Glu Lys 
390 

Asp Pro Glu 
405 

Pro Lys Glu 
420 

His Ser Ser 
435 

He Ser Asn 
450 

Glu Glu Pro 
465 

Ala Tyr Gly 

480 

He Ala Val 
495 

Ala Glu Lys 
510 

Lys Ala Leu 
525 

Leu Gin Glu 
540 

Gly Ser He 
555 

Met Val Phe 

570 

Tyr Leu His 
585 

Asn Val Met 
600 

Gly Cys Ala 

615 

Ser Asp Met 
630 

Pro Glu Val 
645 

Trp Ser He 

660 

Pro Leu Ala 
675 

Ala His Arg 
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680 685 690 

Gly Leu Met Pro Pro Leu Pro Asp His Phe Ser Glu Asn Ala Ala 

695 700 705 

Asp Phe Val Arg Met Cys Leu Thr Arg Asp Gin His Glu Arg Pro 

710 715 720 

Ser Ala Leu Gin Leu Leu Lys His Ser Phe Leu Glu Arg Ser His 

725 730 735 



<210> 7 

<211> 506 

<212> PRT 

<213> Hozno sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: 7483334CD1 

<400> 7 



Met 


Asp 


Asp 


Tyr Met 


Val 


Leu 


Arg Met He Gly Glu Gly Ser Phe 


1 








5 






10 15 


Gly Arg 


Ala 


Leu 


Leu 


Val 


Gin 


Leu Glu Ser Ser Asn Gin Met Phe 










20 






25 30 


Ala 


Met 


Lys 


Glu 


He 
35 


Arg 


Leu 


Pro Lys Ser Phe Ser Asn Thr Gin 
40 45 


Asn 


Ser 


Arg 


Lys 


Glu 
50 


Ala 


Val 


Leu Leu Ala Lys Met Lys His Pro 
55 60 


Asn 


He 


Val 


Ala 


Phe 
65 


Lys 


Glu 


Ser Phe Glu Ala Glu Gly His Leu 
70 75 


Tyr 


He 


Val 


Met 


Glu 


Tyr Cys 


Asp Gly Gly Asp Leu Met Gin Lys 










80 






85 90 


He 


Lys 


Gin 


Gin 


Lys 


Gly Lys 


Leu Phe Pro Glu Asp Met He Leu 










95 






100 105 


Asn Trp 


Phe 


Thr 


Gin 


Met 


Cys 


Leu Gly Val Asn His He His Lys 










110 






115 120 


Lys 


Arg 


Val 


Leu 


His 
125 


Arg 


Asp 


He Lys Ser Lys Asn He Phe Leu 
130 135 


Thr 


Gin 


Asn 


Gly 


Lys 
140 


Val 


Lys 


Leu Gly Asp Phe Gly Ser Ala Arg 
145 150 


Leu 


Leu 


Ser 


Asn 


Pro 

155 


Met 


Ala 


Phe Ala Cys Thr Tyr Val Gly Thr 
160 165 


Pro 


Tyr 


Tyr Val 


Pro 


Pro 


Glu 


He Trp Glu Asn Leu Pro Tyr Asn 










170 






175 180 


Asn 


Lys 


Ser 


Asp 


He 
185 


Trp 


Ser 


Leu Gly Cys He Leu Tyr Glu Leu 
190 195 


Cys 


Thr 


Leu 


Lys 


His 


Pro 


Phe 


Gin Ala Asn Ser Trp Lys Asn Leu 








200 






205 210 


He 


Leu 


Lys 


Val 


Cys Gin Gly 


Cys He Ser Pro Leu Pro Ser His 










215 






220 225 


Tyr 


Ser 


Tyr Glu Leu Gin Phe 


Leu Val Lys Gin Met Phe Lys Arg 










230 






235 240 


Asn 


Pro 


Ser 


His 


Arg 
245 


Pro 


Ser 


Ala Thr Thr Leu Leu Ser Arg Gly 
250 255 


He 


Val 


Ala 


Arg 


Leu 
260 


Val 


Gin 


Lys Cys Leu Pro Pro Glu He He 
265 270 


Met 


Glu 


Tyr Gly Glu Glu Val 


Leu Glu Glu He Lys Asn Ser Lys 
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His Asn Thr 
Ala Leu Oly 
Arg Lys Gly 
Val Glu Ser 
Lys Ser Val 
Arg Gin Trp 
Glu Asn Ala 
Arg Gly Gly 
Gin Trp Leu 
Asn Ala Asp 
Pro Gly Ser 
Glu Ala Ser 
Asp Pro Glu 
Phe Glu Glu 
Lys Arg Ala 



275 
Pro Arg Lys 

290 
Asn Glu Ala 

305 
Ser His Thr 

320 
Ala Leu Arg 

335 
His Leu Arg 

350 
Glu Lys Asn 

365 
Ser lie Leu 

380 
Ser Val lie 

395 
Lys Glu Thr 

410 
Leu Ser Leu 

425 
Glu Gly Phe 

440 
Asp Ser Val 

455 
Arg Leu Olu 

470 
Glu Asp Asp 

485 
Gly Trp Gin 

500 



Lys Thr Asn 
Ser Thr Val 
Asp Leu Glu 
Arg Val Asn 
Lys Ala Ser 
Val Pro Asn 
Thr Ser Ser 
Lys Tyr Ser 
Pro Asp Thr 
Ala Phe Gin 
Leu Lys Gly. 
Asp Gly Gly 
Pro Gly Leu 
Asn Pro Asp 
Gly Leu Cys 



280 

Pro Ser Arg 
295 

Gin Glu Glu 
310 

Ser lie Asn 
325 

Arg Glu Glu 
340 

Ser Pro Asn 

355 

Thr Ala Leu 
370 

Leu Thr Ala 
385 

Lys Asn Thr 
400 

Leu Leu Asn 
415 

Thr Tyr Thr 
430 

Pro Leu Ser 

445 

His Asp Ser 
460 

Asp Glu Glu 
475 

Trp Val Ser 
490 

Asp Arg 
505 



285 

lie Arg lie 
300 

Glu Gin Asp 
315 

Glu Asn Leu 
330 

Lys Gly Asn 
345 

Leu His Arg 
360 

Thr Ala Leu 
375 

Glu Asp Asp 
390 

Thr Arg Lys 
405 

lie Leu Lys 
420 

lie Tyr Arg 
435 

Glu Glu Thr 
450 

Val He Leu 
465 

Asp Thr Asp 
480 

Glu Leu Lys 
495 



<210> 8 
<211> 2014 

<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc^f eature 

<223> Incyte ID No: 7483337CD1 



<400> 8 

Met Glu Thr Leu Asn 
1 5 
Thr Arg Gly Gly Asp 

20 

He Arg Ala Ala Tyr 

35 

Leu Leu Glu Olu Ala 

50 

Gly Gin Leu Leu Ser 

65 

Val Arg Tyr Leu Leu 

80 

Pro Thr Asp Asp Asn 

95 



Gly Ala Gly Asp Thr 

10 

Pro Ala Ala Arg Ser 

25 

Arg Arg Gly Asp Arg 

40 

Cys Asp Gin Cys Ala 

55 

He Pro Ala Ala Tyr 

70 

Ser Lys Arg Leu Val 

85 

Pro Ala Val Val Ala 

100 



Gly Gly Lys Pro Ser 

15 

Arg Arg Thr Glu Gly 

30 

Gly Gly Ala Arg Asp 

45 

Ser Oln Leu Glu Lys 

60 

Gly Asp Leu Glu Met 

75 

Glu Leu Pro Thr Glu 

90 

Ala Tyr Phe Gly His 

105 
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Thr Ala Val Val Gin Asn Thr Leu Pro Thr Glu Pro Thr Asp Asp 

110 115 120 

Asn Pro Ala Val Val Ala Ala Tyr Phe Gly His Thr Ala Val Val 

125 130 135 

Gin Glu Leu Leu Glu Ser Leu Pro Gly Pro Cys Ser Pro Gin Arg 

140 145 150 

Leu Leu Asn Trp Met Leu Ala Leu Ala Cys Gin Arg Gly His Leu 

155 160 165 

Gly Val Val Lys Leu Leu Val Leu Thr His Gly Ala Asp Pro Glu 

170 175 180 

Ser Tyr Ala Val Arg Lys Asn Glu Phe Pro Val lie Val Arg Leu 

185 190 195 

Pro Leu Tyr Ala Ala lie Lys Ser Gly Asn Glu Asp lie Ala lie 

200 205 210 

Phe Leu Leu Arg His Gly Ala Tyr Phe Cys Ser Tyr He Leu Leu 

215 220 225 

Asp Ser Pro Asp Pro Ser Lys His Leu Leu Arg Lys Tyr Phe He 

230 235 240 

Glu Ala Ser Pro Leu Pro Ser Ser Tyr Pro Gly Lys Thr Ala Leu 

245 250 255 

Arg Val Lys Trp Ser His Leu Arg Leu Pro Trp Val Asp Leu Asp 

260 265 270 

Trp Leu He Asp He Ser Cys Gin He Thr Glu Leu Asp Leu Ser 

275 280 285 

Ala Asn Cys Leu Ala Thr Leu Pro Ser Val He Pro Trp Gly Leu 

290 295 300 

He Asn Leu Arg Lys Leu Asn Leu Ser Asp Asn His Leu Gly Glu 

305 310 315 

Leu Pro Gly Val Gin Ser Ser Asp Glu He He Cys Ser Arg Leu 

320 325 330 

Leu Glu He Asp He Ser Ser Asn Lys Leu Ser His Leu Pro Pro 

335 340 345 

Gly Phe Leu His Leu Ser Lys Leu Gin Lys Leu Thr Ala Ser Lys 

350 355 360 

Asn Cys Leu Glu Lys Leu Phe Glu Glu Glu Asn Ala Thr Asn Trp 

365 370 375 

He Gly Leu Arg Lys Leu Gin Glu Leu Asp He Ser Asp Asn Lys 

380 385 390 

Leu Thr Glu Leu Pro Ala Leu Phe Leu His Ser Phe Lys Ser Leu 

395 400 405 

Asn Ser Leu Asn Val Ser Arg Asn Asn Leu Lys Val Phe Pro Asp 

410 415 420 

Pro Trp Ala Cys Pro Leu Lys Cys Cys Lys Ala Ser Arg Asn Ala 

425 430 435 

Leu Glu Cys Leu Pro Asp Lys Met Ala Val Phe Trp Lys Asn His 

440 445 450 

Leu Lys Asp Val Asp Phe Ser Glu Asn Ala Leu Lys Glu Val Pro 

455 460 465 

Leu Gly Leu Phe Gin Leu Asp Ala Leu Met Phe Leu Arg Leu Gin 

470 475 480 

Gly Asn Gin Leu Ala Ala Leu Pro Pro Gin Glu Lys Trp Thr Cys 

485 490 495 

Arg Gin Leu Lys Thr Leu Asp Leu Ser Arg Asn Gin Leu Gly Lys 

500 505 510 

Asn Glu Asp Gly Leu Lys Thr Lys Arg He Ala Phe Phe Thr Thr 



515 



520 



525 
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Arg Gly 
Phe Ser 
Asn Leu 
Ala Phe 
His Leu 
Ser Glu 
Pro Glu 
Asp Leu 
Pro Lys 
Glu Lys 
Gin Gly 
Pro Gin 
Glu Leu 
Leu Arg 
Trp Asp 
Phe Phe 
Leu Gly 
He Glu 
His Leu 
Thr Leu 
Ser Arg 
Glu He 
Leu He 
He Gly 
Leu Ser 
Arg Asp 
Leu Val 
Leu Gin 



Arg Gin Arg Ser Gly Thr Glu Ala Glu Thr Thr Met Glu 

530 535 540 

Ala Ser Leu Val Thr He Val Phe Leu Ser Asn Asn Cys 

545 550 555 

Cys Ala Tyr Thr Cys Ala Ala Ser Val Leu Glu Phe Pro 

560 565 570 

Leu Ser Glu Ser Leu Glu Val Leu Cys Leu Asn Asp Asn 

575 580 585 

Asp Thr Val Pro Pro Ser Val Cys Leu Leu Lys Ser Leu 

590 595 600 

Leu Tyr Leu Gly Asn Asn Pro Gly Leu Arg Glu Leu Pro 

605 610 615 

Leu Gly Gin Leu Gly Asn Leu Trp Gin Leu Asp Thr Glu 

620 625 630 

Thr He Ser Asn Val Pro Ala Glu He Gin Lys Glu Gly 

635 640 645 

Ala Met Leu Ser Tyr Leu Arg Ala Gin Leu Arg Lys Ala 

650 655 660 

Cys Lys Leu Met Lys Met He He Val Gly Pro Pro Arg 

665 670 675 

Lys Ser Thr Leu Leu Glu He Leu Gin Thr Gly Arg Ala 

680 685 690 

Val Val His Gly Glu Ala Thr He Arg Thr Thr Lys Trp 

695 700 705 

Gin Arg Pro Ala Gly Ser Arg Ala Lys Val Lys Asp Gly 

710 715 720 

Ala Glu Ser Leu Trp Val Glu Ser Val Glu Phe Asn Val 

725 730 735 

He Gly Gly Pro Ala Ser Met Ala Thr Val Asn Gin Cys 

740 745 750 

Thr Asp Lys Ala Leu Tyr Val Val . Val Trp Asn Leu Ala 

755 760 765 

Glu Glu Ala Val Ala Asn Leu Gin Phe Trp Leu Leu Asn 

770 775 780 

Ala Lys Ala Pro Asn Ala Val Val Leu Val Val Gly Thr 

785 790 795 

Asp Leu He Glu Ala Lys Phe Arg Val Glu Arg He Ala 

800 805 810 

Arg Ala Tyr Val Leu Ala Leu Cys Arg Ser Pro Ser Gly 

815 820 825 

Ala Thr Gly Phe Pro Asp He Thr Phe Lys His Leu His 

830 835 840 

Ser Cys Lys Ser Leu Glu Gly Gin Glu Gly Leu Arg Oln 

845 850 855 

Phe His Val Thr Cys Ser Met Lys Asp Val Gly Ser Thr 

860 865 870 

Cys Gin Arg Leu Ala Gly Arg Leu He Pro Arg Ser Tyr 

875 880 885 

Leu Gin Glu Ala Val Leu Ala Glu Gin Gin Arg Arg Ser 

890 895 900 

Asp Asp Val Gin Tyr Leu Thr Asp Arg Gin Leu Glu Gin 

905 910 915 

Glu Gin Thr Pro Asp Asn Asp He Lys Asp Tyr Glu Asp 

920 925 930 

Ser Ala He Ser Phe Leu He Glu Thr Gly Thr Leu Leu 



935 



940 



945 
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His Phe Pro Asp Thr 

950 

Asp Pro lie Trp Leu 

965 

Lys Gly Ser Arg Ser 

980 

Asp Leu Arg Met Leu 

995 

Qlu Qlu Gin Tyr Phe 

1010 

Pro Val Ala Asn Asp 

1025 

Lys Pro Gly Leu Asp 

1040 

Thr He Qln Arg Val 

1055 

Trp Gin Arg Phe He 

1070 

Asp Leu Gin Leu Phe 

1085 

Arg Lys Val Thr He 

1100 

Cys Ser Thr Phe Arg 

1115 

Glu Gly Leu Leu Val 

1130 

Ser Ser Asp Val Asn 

1145 

He Val Cys Gin Ser 

1160 

He Thr Asp His Val 

1175 

Leu Thr Ala Thr Glu 

1190 

Val Pro Cys Pro Val 

1205 

Pro Ser Glu Lys Ser 

1220 

Cys Val Leu Thr Ala 

1235 

His Pro Asp Leu Pro 

1250 

Phe Met Thr Asp Phe 

1265 

Leu Glu His Ser Glu 

1280 

Ser Gly Thr Val He 

1295 

Ala Val Lys Arg Phe 

1310 

Val* Pro Ala Asp Thr 

1325 

Met Lys Asn Phe Ser 

1340 

Ala Leu Gin His Pro 

1355 



Ser His Gly Leu Arg 

955 

Ser Glu Cys Leu Gin 

970 

Val Ala Lys Asn Gly 

985 

Leu Val Gly Thr Gly 

1000 

Gin Phe Leu Ala Lys 

1015 

Ser Tyr Leu Leu Pro 

1030 

Thr His Gly Met Arg 

1045 

Phe Lys Met Ser Phe 

1060 

Ala Arg Met Leu He 

1075 

Glu Asn Lys Lys Asn 

1090 

Tyr Ser Phe Thr Gly 

1105 

Val Lys Arg Asn Gin 

1120 

Thr Phe Asp Gly Gly 

1135 

Trp Lys Lys Lys Lys 

1150 

Glu Val Arg Asp Phe 

1165 

Asn Ser Leu He Asp 

1180 

Ser Asp Gly Thr Pro 

1195 

Cys Glu Thr Ala Trp 

1210 

Glu Asp Val Gin Tyr 

1225 

He Glu Arg Asp Phe 

1240 

Val Pro Leu Gin Glu 

1255 

Pro Ala Arg Leu Phe 

1270 

Asp Glu Gly Ser Val 

1285 

Tyr Arg Ala Arg Tyr 

1300 

His He Lys Lys Phe 

1315 

Met Leu Arg His Leu 

1330 

Glu Phe Arg Gin Glu 

1345 

Cys He Val Ala Leu 

1360 



Asn Leu Tyr Phe Leu 

960 

Arg He Phe Asn He 

975 

Val He Arg Ala Glu 

990 

Phe Thr Gin Gin Thr 

1005 

Phe Glu He Ala Leu 

1020 

His Leu Leu Pro Ser 

1035 

His Pro Thr Ala Asn 

1050 

Val Pro Val Gly Phe 

1065 

Ser Leu Ala Glu Met 

1080 

Thr Lys Ser Arg Asn 

1095 

Asn Gin Arg Asn Arg 

1110 

Thr He Tyr Trp Gin 

1125 

Tyr Leu Ser Val Glu 

1140 

Ser Gly Gly Met Lys 

1155 

Ser Ala Met Ala Phe 

1170 

Gin Trp Phe Pro Ala 

1185 

Leu Met Glu Gin Tyr 

1200 

Ala Gin His Thr Asp 

1215 

Phe Asp Met Glu Asp 

1230 

He Ser Cys Pro Arg 

1245 

Leu Val Pro Glu Leu 

1260 

Leu Glu Asn Ser Lys 

1275 

Leu Gly Gin Gly Gly 

1290 

Gin Gly Gin Pro Val 

1305 

Lys Asn Phe Ala Asn 

1320 

Arg Ala Thr Asp Ala 

1335 

Ala Ser Met Leu His 

1350 

He Gly He Ser He 

1365 
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His Pro 
Asn Thr 
Leu Gly 
Gly Leu 
Lys Ser 
lie Asn 
His Glu 
Pro Glu 
Phe Ser 
Pro Ala 
Lys Gly 
Arg Arg 
Glu Lys 
Pro Thr 
Thr Ala 
Trp Asp 
Glu Lys 
Lys Val 
Thr Glu 
Val Ala 
Ser Tyr 
Ala Asp 
Glu Val 
Gly Leu 
Leu Glu 
Ser Ser 
Lys Ala 
Cys Ala 



Leu Cys Phe Ala Leu Glu Leu Ala Pro Leu Ser Ser Leu 

1370 1375 1380 

Val Leu Ser Glu Asn Ala Arg Asp Ser Ser Phe lie Pro 

1385 1390 1395 

His Met Leu Thr Gin Lys lie Ala Tyr Gin lie Ala Ser 

1400 1405 1410 

Ala Tyr Leu His Lys Lys Asn lie lie Phe Cys Asp Leu 

1415 1420 1425 

Asp Asn lie Leu Val Trp Ser Leu Asp Val Lys Glu His 

1430 1435 1440 

lie Lys Leu Ser Asp Tyr Gly lie Ser Arg Gin Ser Phe 

1445 1450 1455 

Gly Ala Leu Gly Val Glu Gly Thr Pro Gly Tyr Gin Ala 

1460 1465 1470 

lie Arg Pro Arg lie Val Tyr Asp Glu Lys Val Asp Met 

1475 1480 1485 

Tyr Gly Met Val Leu Tyr Glu Leu Leu Ser Gly Gin Arg 

1490 1495 1500 

Leu Gly His His Gin Leu Gin lie Ala Lys Lys Leu Ser 

1505 1510 1515 

lie Arg Pro Val Leu Gly Gin Pro Glu Glu Val Gin Phe 

1520 1525 1530 

Leu Gin Ala Leu Met Met Glu Cys Trp Asp Thr Lys Pro 

1535 1540 1545 

Arg Pro Leu Ala Leu Ser Val Val Ser Gin Met Lys Asp 

1550 1555 1560 

Phe Ala Thr Phe Met Tyr Glu Leu Cys Cys Gly Lys Gin 

1565 1570 1575 

Phe Phe Ser Ser Gin Gly Gin Glu Tyr Thr Val Val Phe 

1580 1585 1590 

Gly Lys Qlu Glu Ser Arg Asn Tyr Thr Val Val Asn Thr 

1595 1600 1605 

Gly Leu Met Glu Val Gin Arg Met Cys Cys Pro Gly Met 

1610 1615 1620 

Ser Cys Gin Leu Gin Val Gin Arg Ser Leu Trp Thr Ala 

1625 1630 1635 

Asn Ser Tyr Leu Val Leu Ala Gly Leu Ala Asp Gly Leu 

1640 1645 1650 

Val Phe Pro Val Val Arg Gly Thr Pro Lys Asp Ser Cys 

1655 1660 1665 

Leu Cys Ser His Thr Ala Asn Arg Ser Lys Phe Ser lie 

1670 1675 1680 

Glu Asp Ala Arg Gin Asn Pro Tyr Pro Val Lys Ala Met 

1685 1690 1695 

Val Asn Ser Gly Ser Glu Val Trp Tyr Ser Asn Gly Pro 

1700 1705 1710 

Leu Val lie Asp Cys Ala Ser Leu Glu lie Cys Arg Arg 

1715 1720 1725 

Pro Tyr Met Ala Pro Ser Met Val Thr Ser Val Val Cys 

1730 1735 1740 

Glu Gly Arg Gly Glu Glu Val Val Trp Cys Leu Asp Asp 

1745 1750 1755 

Asn Ser Leu Val Met Tyr His Ser Thr Thr Tyr Gin Leu 

1760 1765 1770 

Arg Tyr Phe Cys Gly Val Pro Ser Pro Leu Arg Asp Met 



1775 



1780 



1785 
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Phe Pro Val Arg Pro Leu Asp Thr Glu Pro Pro Ala Ala Ser His 

1790 1795 1800 

Thr Ala Asn Pro Lys Val Pro Glu Gly Asp Ser He Ala Asp Val 

1805 1810 1815 

Ser He Met Tyr Ser Glu Glu Leu Gly Thr Gin He Leu He His 

1820 1825 1830 

Gin Glu Ser Leu Thr Asp Tyr Cys Ser Met Ser Ser Tyr Ser Ser 

1835 1840 1845 

Ser Pro Pro Arg Gin Ala Ala Arg Ser Pro Ser Ser Leu Pro Ser 

1850 1855 I860 

Ser Pro Ala Ser Ser Ser Ser Val Pro Phe Ser Thr Asp Cys Glu 

1865 1870 1875 

Asp Ser Asp Met Leu His Thr Pro Gly Ala Ala Ser Asp Arg Ser 

1880 1885 1890 

Glu His Asp Leu Thr Pro Met Asp Gly Glu Thr Phe Ser Gin His 

1895 1900 1905 

Leu Gin Ala Val Lys He Leu Ala Val Arg Asp Leu He Trp Val 

1910 1915 1920 

Pro Arg Arg Gly Gly Asp Val He Val He Gly Leu Glu Lys Asp 

1925 1930 1935 

Ser Gly Ala Gin Arg Gly Arg Val He Ala Val Leu Lys Ala Arg 

1940 1945 1950 

Glu Leu Thr Pro His Gly Val Leu Val Asp Ala Ala Val Val Ala 

1955 I960 1965 

Lys Asp Thr Val Val Cys Thr Phe Glu Asn Glu Asn Thr Glu Trp 

1970 1975 1980 

Cys Leu Ala Val Trp Arg Gly Trp Gly Ala Arg Glu Phe Asp He 

1985 1990 1995 

Phe Tyr Gin Ser Tyr Glu Glu Leu Gly Arg Leu Glu Ala Cys Thr 

2000 2005 2010 

Arg Lys Arg Arg 



<210> 9 

<211> 348 

<212> PRT 

<213> Homo sapiens 



<220> 

<221> inisc_f eature 

<223> Incyte ID No: 6035509CD1 



<400> 9 
















Met 


Met 


Leu Gly 


Leu 


Glu 


Ser 


Leu Pro 


Asp Pro Thr Asp Thr 


Trp 


1 






5 








10 


15 


Glu 


He 


He Olu 


Thr 


He 


Gly 


Lys Gly 


Thr Tyr Gly Lys Val 


Tyr 








20 








25 


30 


Lys 


Val 


Thr Asn 


Lys 


Arg 


Asp 


Gly Ser 


Leu Ala Ala Val Lys 


He 






35 








40 


45 


Leu 


Asp 


Pro Val 


Ser Asp 


Met 


Asp Glu 


Glu He Glu Ala Glu 


Tyr 








50 








55 


60 


Asn 


He 


Leu Gin 


Phe 


Leu 


Pro 


Asn His 


Pro Asn Val Val Lys 


Phe 








65 








70 


75 


Tyr 


Gly 


Met Phe 


Tyr 


Lys 


Ala 


Asp His 


Cys Val Gly Gly Gin 


Leu 








80 








85 


90 


Trp 


Leu 


Val Leu 


Glu 


Leu 


Cys 


Asn Gly 


Gly Ser Val Thr Glu 


Leu 
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Val Lys Gly 
lie Ser Tyr 
His Asn Asn 
Leu Leu Thr 
Ser Ala Qln 
Gly Thr Pro 
Gin Tyr Asp 
Gly He Thr 
Asp Met His 
Pro Pro Thr 
His Phe He 
Ser Val Thr 
Gly Lys Val 
Asp Gin Lys 
Met His Thr 
Cys Leu Glu 
Leu Asn He 




95 

Leu Leu Arg 

110 
He Leu Tyr 

125 
Arg He He 

140 
Thr Glu Gly 

155 
Leu Thr Ser 

170 
Phe Trp Met 

185 
Ser Ser Tyr 

200 
Ala He Glu 

215 
Pro Val Lys 

230 
Leu Leu His 

245 
Ser Gin Cys 

260 
His Leu Leu 

275 
Leu Phe Leu 

290 
His Gin Asn 

305 
Arg Arg Pro 

320 
Asp Asp Leu 

335 



Cys Gly Gin 
Gly Ala Leu 
His Arg Asp 
Gly Val Lys 
Thr Arg Leu 
Ala Pro Glu 
Asp Ala Arg 
Leu Gly Asp 
Thr Leu Phe 
Pro Glu Lys 
Leu He Lys 
Asp His Pro 
Gin Lys Qln 
Pro Val Ala 
Tyr His Val 
Val Asn Leu 



100 

Arg Leu Asp 
115 

Leu Gly Leu 

130 

Val Lys Gly 
145 

Leu Val Asp 
160 

Arg Arg Asn 

175 

Val He Ala 
190 

Cys Asp Val 
205 

Gly Asp Pro 
220 

Lys He Pro 
235 

Trp Cys Glu 
250 

Asp Phe Glu 
265 

Phe He Lys 
280 

Leu Ala Lys 
295 

Lys Thr Arg 

310 

Glu Asp Ala 
325 

Glu Val Leu 
340 
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105 

Glu Ala Met 
120 

Gin His Leu 

135 

Asn Asn He 
150 

Phe Gly Val 
165 

Thr Ser Val 

180 

Cys Glu Gin 
195 

Trp Ser Leu 
210 

Pro Leu Phe 
225 

Arg Asn Pro 
240 

Glu Phe Asn 
255 

Arg Arg Pro 

270 

Gly Val His 
285 

Val Leu Gin 
300 

His Glu Arg 
315 

Glu Lys Tyr 
330 

Asp Glu Val 
345 



<210> 10 
<211> 2042 
<212> PRT 

<213> Homo sapiens 



<220> 

<221> misc_f eature 

<223> Incyte ID No: 7373485CD1 



<400> 10 
Met Ala Thr 
1 

Asp Leu Pro 
Lys Pro Leu 
Tyr Ser Ser 
Glu Gly Gly 



Asp Asp Lys 
5 

Arg Ser Pro 
20 

Thr Pro Asp 
35 

Phe Val Asn 

50 

Gin Gly Glu 
65 



Thr Ser Pro 
Thr Ser Pro 
Gin Asp Glu 
Leu Phe Arg 
Gin Gin Pro 



Thr Leu Asp 
10 

Ser His Leu 
25 

Pro Pro Phe 
40 

Phe Asn Lys 

55 

Leu Ser Gly 
70 



Ser Ala Asn 

15 

Thr His Phe 
30 

Lys Ser Ala 
45 

Glu Arg Ala 

60 

Ser Trp Thr 



18/75 



wo 02/33099 




Ser Pro 


Gin 


Leu Pro Ser 






80 


Pro Tyr 


Lys 


Lys Gin Leu 






95 


Ala Leu Asp Thr Arg Arg 






110 


Asp Pro 


Arg 


Thr Ala Val 






125 


Lys Arg 


Leu Lys Glu He 






140 


Leu Lys 


Gin Tyr Trp Met 






155 


Asp Cys 


Ser 


Glu Lys Phe 






170 


Arg Leu Cys Gly Gin lie 






185 


lie Pro 


Gly Lys Phe Met 






200 


Thr Tyr 


Cys 


Arg Lys He 






215 


Ser Asn 


Ser He Gly Glu 






230 


Cys Ser 


Val 


Ser Val Leu 






245 


Gly Ser 


Arg 


Lys Ala Ser 






260 


Ala Trp 


Gin 


Ser Leu He 






275 


Ser Thr 


Arg 


Leu Val Ser 






290 


Ala Arg 


Asn 


Arg Ser Ala 






305 


Ser Gly Ser 


Pro Met Val 






320 


Gin Ala Asn Arg Thr Tyr 






335 


Arg Lys 


He 


Leu Leu Asp 






350 


Lys lie 


Cys 


His His Ser 






365 


Tyr Trp 


Leu 


Arg Thr His 






380 


Val Asn 


Trp 


Leu He Arg 






395 


Ala He 


Ala 


He Gly Gin 






410 


Cys Val 


Ser 


His His Asp 






425 


Tyr Arg 


Pro 


Leu Gin Ser 






440 


Asp Ser 


Asp 


Ser Val Asn 






455 


Trp Phe 


Lys 


Asp He Lys 






470 


Ala Glu Glu Gly Asp Asp 






485 



Arg Thr Gin Ser Val Arg 

85 

Asn Glu Glu Leu Gin Arg 

100 

Lys Ala Glu Pro Thr Phe 

115 

Gin Leu Arg Ser Leu Ser 

130 

Met Glu Gly Lys Ser Gin 

145 

Pro Asp Ser Gin Cys Lys 

160 

Thr Thr Phe Arg Arg Arg 

175 

Phe Cys Ser Arg Cys Cys 

190 

Gly Tyr Thr Gly Asp Leu 

205 

Ala Leu Ser Tyr Ala His 

220 

Asp Leu Asn Ala Leu Ser 

235 

Asp Pro Ser Glu Pro Arg 

250 

Arg Asn He Phe Leu Glu 

265 

His Pro Asp Ser Ser Asn 

280 

Val Gin Glu Asp Ala Gly 

295 

Ser He Thr Asn Leu Ser 

310 

Pro Ser Tyr Glu Thr Ser 

325 

Val Arg Thr Glu Thr Thr 

340 

Ser Val Gin Leu Lys Asp 

355 

Ser Gly Met Glu Phe Gin 

370 

Pro Asn Cys He Val Gly 

385 

Asn Gly His He Ala Thr 

400 

Ala Met Val Asp Gly Arg 

415 

Gin Leu Phe Arg Asp Glu 

430 

Thr Glu Phe Ser Glu Thr 

445 

Ser Val Glu Gly His Ser 

460 

Phe Asp Asp Ser Asp Thr 

475 

Asn Leu Ala Lys Tyr Leu 

490 
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Ser Pro Thr 
90 

Arg Ser Ser 

105 

Gly Gly His 
120 

Thr Val Leu 
135 

Asp Ser Asp 
150 

Glu Cys Tyr 
165 

His His Cys 
180 

Asn Gin Glu 
195 

Arg Ala Cys 
210 

Ser Thr Asp 
225 

Asp Ser Ala 
240 

Thr Pro Val 
255 

Asp Asp Leu 

270 

Thr Pro Leu 
285 

Lys Ser Pro 
300 

Leu Asp Arg 

315 

Val Ser Pro 
330 

Glu Asp Glu 
345 

Leu Trp Lys 
360 

Asp His Arg 
375 

Lys Glu Leu 

390 

Arg Ala Gin 
405 

Trp Leu Asp 
420 

Tyr Ala Leu 
435 

Pro Ser Pro 
450 

Glu Pro Ser 
465 

Glu Gin He 
480 

He Ser Asp 
495 
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Thr Gly 
Ser Leu 
Phe Thr 
Glu Glu 
Asn His 
Asp Ser 
Cys Gin 
Asp Asp 
Gly Gly 
Thr Lys 
Pro Lys 
Glu Glu 
Arg Glu 
Pro Thr 
Asp Met 
Ser Gin 
Val Met 
Cys His 
Thr Lys 
Cys Thr 
Val Lys 
Gin Leu 
Thr Leu 
Gly His 
Pro Trp 
Ser Ser 
Glu Asn 
Glu His 



Gly Gin Gin Leu Ser He Ser Asp Ala Phe He Lys Glu 

500 505 510 

Phe Asn Arg Arg Val Glu Glu Lys Ser Lys Glu Leu Pro 

515 520 525 

Pro Leu Gly Trp His His Asn Asn Leu Glu Leu Leu Arg 

530 535 540 

Asn Gly Glu Lys Gin Ala Met Glu Arg Leu Leu Ser Ala 

545 550 555 

Asn His Met Met Ala Leu Leu Gin Gin Leu Leu His Ser 

560 565 570 

Leu Ser Ser Ser Trp Arg Asp He He Val Ser Leu Val 

575 580 585 

Val Val Gin Thr Val Arg Pro Asp Val Lys Asn Gin Asp 

590 595 600 

Met Asp He Arg Gin Phe Val His He Lys Lys He Pro 

605 610 615 

Lys Lys Phe Asp Ser Val Val Val Asn Gly Phe Val Cys 

620 625 630 

Asn He Ala His Lys Lys Met Asn Ser Cys He Lys Asn 

635 640 645 

He Leu Leu Leu Lys Cys Ser He Glu Tyr Leu Tyr Arg 

650 • 655 660 

Thr Lys Phe Thr Cys He Asp Pro He Val Leu Gin Glu 

665 670 675 

Phe Leu Lys Asn Tyr Val Gin Arg He Val Asp Val Arg 

680 685 690 

Leu Val Leu Val Glu Lys Thr Val Ser Arg He Ala Gin 

695 700 705 

Leu Leu Glu His Gly He Thr Leu Val He Asn Val Lys 

710 715 720 

Val Leu Glu Arg He Ser Arg Met Thr Gin Gly Asp Leu 

725 730 735 

Ser Met Asp Gin Leu Leu Thr Lys Pro Arg Leu Gly Thr 

740 745 750 

Lys Phe Tyr Met Gin He Phe Gin Leu Pro Asn Glu Gin 

755 760 765 

Thr Leu Met Phe Phe Glu Gly Cys Pro Gin His Leu Gly 

770 775 780 

He Lys Leu Arg Gly Gly Ser Asp Tyr Glu Leu Ala Arg 

785 790 795 

Glu He Leu He Phe Met He Cys Val Ala Tyr His Ser 

800 805 810 

Glu He Ser Phe Leu Met Asp Glu Phe Ala Met Pro Pro 

815 820 825 

Met Gin Asn Pro Ser Phe His Ser Leu He Glu Gly Arg 

830 835 840 

Glu Gly Ala Val Gin Glu Gin Tyr Gly Gly Gly Ser He 

845 850 855 

Asp Pro Asp He Pro Pro Glu Ser Leu Pro Cys Asp Asp 

860 865 870 

Leu Leu Glu Ser Arg He Val Phe Glu Lys Gly Glu Gin 

875 880 885 

Lys Asn Leu Pro Gin Ala Val Ala Ser Val Lys His Gin 

890 895 900 

Ser Thr Thr Ala Cys Pro Ala Gly Leu Pro Cys Ala Phe 



905 



910 



915 
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Phe Ala Pro Val Pro Glu Ser Leu Leu Pro Leu Pro Val Asp Asp 

920 925 930 

Gin Gin Asp Ala Leu Gly Ser Glu Leu Pro Glu Ser Leu Gin Gin 

935 940 945 

Thr Val Val Leu Gin Asp Pro Lys Ser Gin He Arg Ala Phe Arg 

950 955 960 

Asp Pro Leu Gin Asp Asp Thr Gly Leu Tyr Val Thr Glu Glu Val 

965 970 975 

Thr Ser Ser Glu Asp Lys Arg Lys Thr Tyr Ser Leu Ala Phe Lys 

980 985 990 

Gin Glu Leu Lys Asp Val He Leu Cys He Ser Pro Val He Thr 

995 1000 1005 

Phe Arg Glu Pro Phe Leu Leu Thr Glu Lys Gly Met Arg Cys Ser 

1010 1015 1020 

Thr Arg Asp Tyr Phe Ala Glu Gin Val Tyr Trp Ser Pro Leu Leu 

1025 1030 1035 

Asn Lys Qlu Phe Lys Glu Met Glu Asn Arg Arg Lys Lys Gin Leu 

1040 1045 1050 

Leu Arg Asp Leu Ser Gly Leu Gin Gly Met Asn Gly Ser He Gin 

1055 1060 1065 

Ala Lys Ser He Gin Val Leu Pro Ser His Glu Leu Val Ser Thr 

1070 1075 1080 

Arg He Ala Glu His Leu Gly Asp Ser Gin Ser Leu Gly Arg Met 

1085 1090 1095 

Leu Ala Asp Tyr Arg Ala Arg Gly Gly Arg He Gin Pro Lys Asn 

1100 1105 1110 

Ser Asp Pro Phe Ala His Ser Lys Asp Ala Ser Ser Thr Ser Ser 

1115 1120 1125 

Gly Lys Ser Gly Ser Lys Asn Glu Gly Asp Glu Glu Arg Gly Leu 

1130 1135 1140 

He Leu Ser Asp Ala Val Trp Ser Thr Lys Val Asp Cys Leu Asn 

1145 1150 1155 

Pro He Asn His Gin Arg Leu Cys Val Leu Phe Ser Ser Ser Ser 

1160 1165 1170 

Ala Gin Ser Ser Asn Ala Pro Ser Ala Cys Val Ser Pro Trp He 

1175 1180 1185 

Val Thr Met Glu Phe Tyr Gly Lys Asn Asp Leu Thr Leu Gly He 

1190 1195 1200 

Phe Leu Glu Arg Tyr Cys Phe Arg Pro Ser Tyr Gin Cys Pro Ser 

1205 1210 1215 

Met Phe Cys Asp Thr Pro Met Val His His He Arg Arg Phe Val 

1220 1225 1230 

His Gly Gin Gly Cys Val Gin He He Leu Lys Glu Leu Asp Ser 

1235 1240 1245 

Pro Val Pro Gly Tyr Gin His Thr He Leu Thr Tyr Ser Trp Cys 

1250 1255 1260 

Arg He Cys Lys Gin Val Thr Pro Val Val Ala Leu Ser Asn Glu 

1265 1270 1275 

Ser Trp Ser Met Ser Phe Ala Lys Tyr Leu Glu Leu Arg Phe Tyr 

1280 1285 1290 

Gly His Gin Tyr Thr Arg Arg Ala Asn Ala Glu Pro Cys Gly His 

1295 1300 1305 

Ser He His His Asp Tyr. His Gin Tyr Phe Ser Tyr Asn Gin Met 

1310 1315 1320 

Val Ala Ser Phe Ser Tyr Ser Pro He Arg Leu Leu Glu Val Cys 

1325 1330 1335 
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Val Pro 
Val Ser 
Gin Val 
Asp Thr 
Ala Gin 
Lys Met 
Gin Leu 
Leu Cys 
Phe Gin 
Pro Gly 
Asp Ala 
Lys Glu 
Ser Ser 
Qlu Gin 
Ser Glu 
Gin Val 
Leu Pro 
Asp Lys 
Val Cys 
Cys Lys 
Gin Trp 
Asp Ser 
Ser Gly 
Thr Lys 
Gly Lys 
Gly Ala 
Gly Thr 
Gly Asp 



Leu Pro Lys lie Phe lie Lys Arg Gin Ala Pro Leu Lys 

1340 1345 1350 

Leu Leu Gin Asp Leu Lys Asp Phe Phe Gin Lys Val Ser 

1355 1360 1365 

Tyr Val Ala lie Asp Glu Arg Leu Ala Ser Leu Lys Thr 

1370 1375 1380 

Phe Ser Lys Thr Arg Glu Glu Lys Met Glu Asp lie Phe 

1385 1390 1395 

Lys Glu Met Glu Glu Gly Glu Phe Lys Asn Trp lie Glu 

1400 1405 1410 

Gin Ala Arg Leu Met Ser Ser Ser Val Asp Thr Pro Gin 

1415 1420 1425 

Gin Ser Val Phe Glu Ser Leu He Ala Lys Lys Gin Ser 

1430 1435 1440 

Glu Val Leu Gin Ala Trp Asn Asn Arg Leu Gin Asp Leu 

1445 1450 1455 

Gin Glu Lys Gly Arg Lys Arg Pro Ser Val Pro Pro Ser 

1460 1465 1470 

Arg Leu Arg Gin Gly Glu Glu Ser Lys He Ser Ala Met 

1475 1480 1485 

Ser Pro Arg Asn He Ser Pro Gly Leu Gin Asn Gly Glu 

1490 1495 1500 

Asp Arg Phe Leu Thr Thr Leu Ser Ser Qln Ser Ser Thr 

1505 1510 1515 

Thr His Leu Gin Leu Pro Thr Pro Pro Glu Val Met Ser 

1520 1525 1530 

Ser Val Gly Gly Pro Pro Glu Leu Asp Thr Ala Ser Ser 

1535 1540 1545 

Asp Val Phe Asp Gly His Leu Leu Gly Ser Thr Asp Ser 

1550 1555 1560 

Lys Glu Lys Ser Thr Met Lys Ala He Phe Ala Asn Leu 

1565 1570 1575 

Gly Asn Ser Tyr Asn Pro He Pro Phe Pro Phe Asp Pro 

1580 1585 1590 

His Tyr Leu Met Tyr Glu His Glu Arg Val Pro He Ala 

1595 1600 1605 

Glu Lys Glu Pro Ser Ser He He Ala Phe Ala Leu Ser 

1610 1615 1620 

Glu Tyr Arg Asn Ala Leu Glu Glu Leu Ser Lys Ala Thr 

1625 1630 1635 

Asn Ser Ala Glu Glu Gly Leu Pro Thr Asn Ser Thr Ser 

1640 1645 1650 

Arg Pro Lys Ser Ser Ser Pro He Arg Leu Pro Glu Met 

1655 1660 1665 

Gly Gin Thr Asn Arg Thr Thr Glu Thr Glu Pro Gin Pro 

1670 1675 1680 

Lys Ala Ser Gly Met Leu Ser Phe Phe Arg Gly Thr Ala 

1685 1690 1695 

Ser Pro Asp Leu Ser Ser Gin Lys Arg Glu Thr Leu Arg 

1700 1705 1710 

Asp Ser Ala Tyr Tyr Gin Val Qly Gin Thr Gly Lys Glu 

1715 1720 1725 

Qlu Asn Gin Gly Val Glu Pro Gin Asp Glu Val Asp Gly 

1730 1735 1740 

Thr Gin Lys Lys Gin Leu He Asn Pro His Val Glu Leu 



1745 



1750 



1755 
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Gin Phe Ser Asp Ala Asn Ala 

1760 

Ala Gly Glu Phe His Lys Met 

1775 

Glu Glu Asp Phe lie Arg Ser 

1790 

Ala Arg Gly Gly Lys Ser Gly 

1805 

Asp Arg Phe lie Leu Lys Gin 

1820 

Phe Leu Asp Phe Ala Pro His 

1835 

Val Gin Gin Lys Arg Pro Thr 

1850 

Tyr Arg lie Gly Tyr Lys Asn 

1865 

Leu Asp Leu Leu Val Met Glu 

1880 

Ala Gin Val Phe Asp Leu Lys 

1895 

Lys Thr Asp Thr Gly Lys Glu 

1910 

Glu Asn Leu Leu Lys Met Val 

1925 

Ser His Ser Lys Ala Val Leu 

1940 

His Phe Leu Ser Ser His Leu 

1955 

Gly Arg Asp Asp Thr Ser Asn 

1970 

Tyr He Arg Thr Phe Thr Trp 

1985 

Lys Ser Thr Gly He Leu Gly 

2000 

Val Ser Pro Glu Leu Tyr Arg 

2015 

Lys Tyr Phe Leu Met Val Pro 

2030 

Asn Cys 



Lys 


Phe Tyr Cys 


Arg Leu Tyr Tyr 




1765 






Arg 


Glu Val He 


Leu Asp 


oer oer 




1780 




L f oo 


Leu 


Ser His Ser 


Ser Pro 


1 rp uin 




1795 




XOUU 


Ala 


Ala Phe Tyr 


Ala Tnr 


viXU ASp 




1810 




1815 


Met 


Pro Arg Leu 


Glu Val 


Gin Ser 




1825 




1830 


Tyr 


Phe Asn Tyr 


He Thr 


Asn Ala 




1840 




1845 


Ala 


Leu Ala Lys 


He Leu Gly Val 




1855 




1860 


Ser 


Gin Asn Asn 


Thr Glu 


Lys Lys 




1870 




1875 


Asn 


Leu Phe Tyr Gly Arg Lys Met 




1885 




1890 


Gly 


Ser Leu Arg 


Asn Arg 


Asn Val 




1900 




1905 


Ser Cys Asp Val 


Val Leu 


Leu Asp 




1915 




1920 


Arg 


Asp Asn Pro 


Leu Tyr 


He Arg 




1930 




1935 


Arg 


Thr Ser He 


His Ser 


Asp Ser 




1945 




1950 


He 


He Asp Tyr 


Ser Leu 


Leu Val 




1960 




1965 


Glu 


Leu Val Val 


Gly He 


He Asp 




1975 




1980 


Asp 


Lys Lys Leu 


Glu Met 


Val Val 




1990 




1995 


Gly Gin Gly Lys Met Pro 


Thr Val 




2005 




2010 


Thr 


Arg Phe Cys 


Glu Ala 


Met Asp 




2020 




2025 


Asp His Trp Thr Gly Leu Gly Leu 




2035 




2040 



<210> 11 
<211> 551 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> ndsc_f eature 
<223> Incyte ID No: 

<400> 11 

Met Ser Gly Gly Glu 
1 5 
Asp Val Gly Thr Gly 

20 

Gly Val Leu Leu Ala 



5734965CD1 



Gin Lys Pro Glu Arg 

10 

Ser Val Arg Ala Ala 

25 

Phe Ala Asp Gin Pro 



Tyr Tyr Val Gly Val 

15 

Leu Val Asp Gin Ser 

30 

He Lys Asn Trp Glu 
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Pro Gin Phe Asn 
Ala Cys Cys Val 
Asn Gin lie Arg 
Val Leu Asp Lys 
His Arg 
Val Asn 



Asp Ser 
Ser Gin 



Tyr Val Gly Gly 



Leu Trp 
Gly His 
Gly Val 
Tyr Ser 
Gly Leu 
Gin Val 



Leu Lys 
Phe Phe 
Thr Ala 
Ala Qlu 
Glu Asp 
Leu Pro 



Glu Ala Ala Arg 
Ala Ser Leu lie 
Ala Asp Val Arg 
Thr Ser 



Gly lie 
Tyr Phe 
Gin Ser 
Ala Ala 
Ser lie 
Ala Gin 
Asp Phe 
Gly Met 
Ala He 
Arg Phe 
Thr Leu 



Arg Leu 
Ser Lys 
Ser Ala 
Val Thr 
Phe Pro 
Tyr Ala 
Pro Val 
His Gly 
Val Thr 
Leu Tyr 
He He 
Phe Leu 



35 
His 

50 
Val 

65 
Gly 

80 
Gin 

95 
Asn 
110 
Arg 
125 
Val 
140 
Glu 
155 
Asp 
170 
Arg 
185 
Lys 
200 
Phe 
215 
Pro 
230 
Asp 
245 
Asp 
260 
Gly 
275 
Ala 
290 
Asp 
305 
Met 
320 
Gly 
335 
Glu 
350 
Tyr 
365 
Gly 
380 
Asn 
395 
Gly 
410 
Leu 
425 
Glu 
440 
Cys 



His Glu Gin Ser 

Val 



Thr 



Lys Lys 
Gly Phe 
His Pro 



Leu 
Phe 
Val He Met 
He Asn Glu 



Asp 
Leu 
Trp 
Thr 



Met 
Asn 
Leu 
Ser 
Gly 
Val 
Gly 
Leu 
Ala 
His 
Val 
Pro 
Val 
Lys 
Leu 
Leu 
Phe 



Ser Val Glu 
Olu 
Phe 



Leu Arg 
Pro Asp 



Leu Cys 
Trp Asp 
Ala Asp 
Ala Ser 
Gly Leu 
His Ala 
Gly Leu 
He Cys 
He Phe 
Pro Gly 
Leu He 
Gin Val 
Asn Ser 
Leu Thr 



Arg Ser Pro 
Leu 
Ala 



Lys Leu 
Thr Val 



Ser 
Asp 
Asn 
Leu 
Leu 
Gly 
He 
Gly 
Val 
Phe 
Asp 
Lys 
His 
Val 
Leu 
Ser 
Gin 



Ala Met Glu Ala 



Gly Gly Leu Ser 



40 
Ser 

55 
Val 

70 
Ala 

85 
Pro 
100 
Leu 
115 
Lys 
130 
Met 
145 
He 
160 
Leu 
175 
Leu 
190 
Ser 
205 
Tyr 
220 
Gly 
235 
Pro 
250 
Gly 
265 
Cys 
280 
Thr 
295 
Pro 
310 
Trp 
325 
His 
340 
Ala 
355 
Leu 
370 
Asp 
385 
Ala 
400 
Gin 
415 
Ala 
430 
Ala 
445 
Lys 



Glu Asp He Trp 
Gin Gly He Asp 
Thr Cys Ser Leu 
Val Asn Gin Glu 



Asp His 
His Ser 



Arg Ala 
Val Leu 



Gin Ala Pro Lys 



Cys Trp 
Ser Trp 
Val Cys 
Phe Trp 
Ser Lys 
Asn Gly 
Gly He 
Leu Gly 
Glu Gly 
Ser Ser 
Gly Val 
Leu Asn 
Met Val 
Thr Ala 
Asp Leu 
Leu His 
Asp Leu 
Asp Leu 
He Ala 
Gly His 
Asn Pro 



Asp Lys 
Lys Ala 
Lys Trp 
Lys Met 
He Gly 
Leu Thr 
Ala Val 
Val He 
Gin Pro 
cys His 
Trp Gly 
Glu Gly 
Gin Gly 
Arg Cys 
He Lys 
Val Trp 
Thr Leu 
Asp Asp 
Leu Gly 
Ser He 
Leu Phe 



45 
Ala 

60 
Leu 

75 
Val 

90 
Gly 
105 
Val 
120 
Gin 
135 
Leu 
150 
Ala 
165 
Thr 
180 
Thr 
195 
He 
210 
Asn 
225 
Pro 
240 
Ala 
255 
Gly 
270 
Val 
285 
Met 
300 
Pro 
315 
Gly 
330 
His 
345 
Gin 
360 
Lys 
375 
Pro 
390 
Lys 
405 
Leu 
420 
Thr 
435 
Ser 
450 
Val 
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Gin Met His 
Glu Val Glu 
Ala Ser Gly 
Ser Lys Val 
Tyr Tyr Asp 
Gin Lys Glu 



455 
Ala Asp He 

470 
Ser Val Leu 

485 
Asp Phe Ala 

500 
Gly Lys Val 

515 
Lys Lys Tyr 

530 
Tyr Leu Ala 

545 



Thr Gly Met 
Val Gly Ala 
Ser Val Gin 
Val Phe Pro 
Gin Val Phe 
He Met Asn 



460 

Pro Val Val 
475 

Ala Val Leu 
490 

Glu Ala Met 

505 

Arg Leu Gin 
520 

Leu Lys Leu 
535 

Asp Asp 
550 



465 

Leu Ser Gin 
480 

Gly Ala Cys 
495 

Ala Lys Met 
510 

Asp Lys Lys 
525 

Val Glu His 
540 



<210> 12 

<211> 485 

<212> PRT 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<223> Incyte ID No: 7473788CD1 



<400> 12 
Met Arg Ser Gly 
1 

Pro Gly Ser Pro 
Ala Pro Ser Ala 
Arg Asp Ser Gly 
Trp Ser Lys Trp 
Thr Ala Arg Arg 
Asp His Phe Gin 
Lys Val Cys He 
Met Lys Tyr Met 
Arg Asn Val Phe 
Val Phe Leu Val 
Met Phe Met Val 
His Leu Gin Gin 
Tyr He Cys Glu 
His He He His 
Glu Arg Gly His 



Ala Glu 
5 

Pro Pro 

20 
Leu Pro 

35 
Asp Val 

50 
Lys Lys 

65 
Pro Val 

80 
He Leu 

95 
Val Gin 
110 

Asn Lys 
125 

Arg Glu 

140 

Asn Leu 
155 

Val Asp 
170 

Asn Val 
185 

Met Ala 
200 

Arg Asp 
215 

Ala His 
230 



Arg Arg 
Gly Arg 
Pro Pro 
Arg Ser 
Arg Met 
Phe Asp 
Arg Ala 
Lys Arg 
Gin Gin 
Leu Glu 
Trp Tyr 
Leu Leu 
Gin Phe 
Leu Ala 
Val Lys 
Leu Thr 



Gly 

Ala 

Ala 

Gin 

Gly 

Asp 

He 

Asp 

Cys 

He 

Ser 

Leu 

Ser 

Leu 

Pro 

Asp 



Ser 

10 
Arg 

25 
Ala 

40 
Pro 

55 
Ser 

70 
Lys 

85 
Gly 
100 
Thr 
115 
He 
130 
Leu 
145 
Phe 
160 
Gly 
175 
Glu 
190 
Asp 
205 
Asp 
220 
Phe 
235 



Ser 
Pro 
Gly 
Arg 
Ser 
Glu 
Lys 
Glu 
Glu 
Gin 
Gin 
Gly 
Asp 
Tyr 
Asn 
Asn 



Ala Ala Ala 



Ala Gly Ser 
Gin Pro Arg 
Pro Leu Phe 
Met Ser Ala 
Asp Val Asn 
Gly Ser Phe 
Lys Met Tyr 
Arg Asp Glu 
Glu He Glu 
Asp Glu Glu 
Asp Leu Arg 
Thr Val Arg 
Leu Arg Gly 
He Leu Leu 
He Ala Thr 



Ser 

15 
Asp 

30 
Ala 

45 
Gin 

60 
Ala 

75 
Phe 

90 
Gly 
105 
Ala 
120 
Val 
135 
His 
150 
Asp 
165 
Tyr 
180 
Leu 
195 
Gin 
210 
Asp 
225 
He 
240 
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lie Lys Asp 
Tyr Met Ala 
Gly Tyr Ser 
Tyr Glu Leu 
Asn Ala Val 
Gin Tyr Val 
Lys Leu Leu 
Asp Val Qln 
Leu Ser Glu 
Arg Leu His 
Glu Ser Arg 
Lys Ser Arg 
Tyr Leu Oln 
Phe Asn Arg 
Pro Leu Pro 
Thr Arg Arg 
Ser Ala Gly 



Gly Glu Arg 

245 
Pro Glu He 

260 
Phe Glu Val 

275 
Leu Arg Gly 

290 
Glu Ser Leu 

305 
Pro Thr Trp 

320 
Thr Val Asn 

335 
Ala Ala Pro 

350 
Lys Arg Val 

365 
Cys Asp Pro 

380 
Pro Leu His 

395 
Asp Asn Ser 

410 
Asp Cys Leu 

425 
Glu Lys Leu 

440 
Ala Leu Ser 

455 
Thr Leu Arg 

470 
Ser Gly 

485 



Ala Thr Ala 
Phe His Ser 
Asp Trp Trp 
Trp Arg Pro 
Val Gin Leu 
Ser Lys Glu 
Pro Glu His 
Ala Leu Ala 
Glu Pro Gly 
Thr Phe Glu 
Lys Lys Lys 
Arg Asp Ser 
Asp Ala He 
Lys Arg Ser 
Pro Gly Met 
Leu Pro Met 



Leu Ala Gly 

250 

Phe Val Asn 
265 

Ser Val Gly 
280 

Tyr Asp He 
295 

Phe Ser Thr 
310 

Met Val Ala 
325 

Arg Leu Ser 
340 

Gly Val Leu 
355 

Phe Val Pro 
370 

Leu Glu Glu 
385 

Lys Arg Leu 
400 

Ser Gin Ser 
415 

Gin Oln Asp 
430 

Gin Asp Leu 
445 

Leu Arg Ser 
460 

Cys Gly Pro 

475 



Thr Lys Pro 

255 

Gly Gly Thr 
270 

Val Met Ala 
285 

His Ser Ser 
300 

Val Ser Val 
315 

Leu Leu Arg 
330 

Ser Leu Gin 

345 

Trp Asp His 
360 

Asn Lys Gly 
375 

Met He Leu 
390 

Ala Lys Asn 
405 

Glu Asn Asp 
420 

Phe Val He 
435 

Pro Arg Glu 
450 

Leu Trp Arg 
465 

He Cys Pro 
480 



<210> 13 
<211> 282 
<212> PRT 

<213> Homo sapiens 



<220> 

<221> misc_feature 
<223> Incyte ID No: 

<400> 13 

Met Pro Ala Phe He 
1 5 
Leu His Thr Val Phe 

20 

Glu Oln Asp Phe Leu 

35 

Asp Glu Leu Asp Cys 

50 

Val Gly Phe Asn Gly 

65 

Ala Asp Ser Glu Lys 



3107989CD1 



Oln Met Gly Arg Asp 

10 

Cys Ala Thr Gly Gly 

25 

Thr He Gly Asp Leu 

40 

Leu He Lys Gly He 

55 

Arg Ser Oln Cys Tyr 

70 

Cys Oln Lys Leu Pro 



Lys Asn Phe Ser Ser 

15 

Gly Ala Tyr Lys Phe 

30 

Gin Leu Cys Lys Leu 

45 

Leu Tyr He Asp Ser 

60 

Tyr Phe Glu Asn Pro 

75 

Phe Asp Leu Lys Asn 
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80 



85 



90 



Pro Tyr Pro Leu Leu Leu Val Asn lie Gly Ser Gly Val Ser lie 

95 100 105 

Leu Ala Val Tyr Ser Lys Asp Asn Tyr Lys Arg Val Thr Gly Thr 

110 115 120 

Ser Leu Gly Qly Gly Thr Phe Phe Gly Leu Cys Cys Leu Leu Thr 

125 130 135 

Gly Cys Thr Thr Phe Glu Glu Ala Leu Glu Met Ala Ser Arg Gly 

140 145 150 

Asp Ser Thr Lys Val Asp Lys Leu Val Arg Asp He Tyr Gly Gly 

155 160 165 

Asp Tyr Glu Arg Phe Gly Leu Pro Gly Trp Ala Val Ala Ser Ser 

170 175 180 

Phe Gly Asn Met Met Ser Lys Glu Lys Arg Asp Ser He Ser Lys 

185 190 195 

Glu Asp Leu Ala Arg Ala Thr Leu Val Thr He Thr Asn Asn He 

200 205 210 

Gly Ser He Ala Arg Met Cys Ala Leu Asn Glu Asn He Asp Arg 

215 220 225 

Val Val Phe Val Gly Asn Phe Leu Arg He Asn Met Val Ser Met 

230 235 240 

Lys Leu Leu Ala Tyr Ala Met Asp Phe Trp Ser Lys Gly Gin Leu 

245 250 255 

Lys Ala Leu Phe Leu Glu His Glu Qly Tyr Phe Gly Ala Val Gly 

260 265 270 

Ala Leu Leu Glu Leu Phe Lys Met Thr Asp Asp Lys 



<210> 14 
<211> 151 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_f eature 

<223> Incyte ID No: 7482887CD1 

<400> 14 

Met Ala Asn Thr Glu Ser He He He Asn Pro Ser Ala Val Gin 
15 10 15 

His Ser Leu Val Gly Glu He He Lys Tyr Ser Glu Gin Lys Gly 

20 25 30 

Phe Tyr Leu Val Thr Met Lys Phe Leu Arg Ala Ser Glu Lys Pro 

35 40 45 

Leu Lys Pro His Tyr Thr Asn Leu Lys Asp His Pro Phe Phe Pro 

50 55 60 

Asp Leu Val Lys Tyr Met Asn Ser Gly Gin Val Val Ala Met Val 

65 70 75 

Leu Glu Gly Leu Asn Val Ala Lys Thr Gly Leu Arg Met Leu Qly 

80 85 90 

Glu Thr Asn Ser Leu Gly Ser Met Leu Glu Thr He He Arg Arg 

95 100 105 

Asp Phe Cys Ala Lys He Gly Gly Asn Val He Gly Qly Ser Asp 

110 115 120 

Ser Leu Qln Ser Ala Glu Lys Glu He Ser Leu Trp Phe Lys Pro 



275 



280 



125 



130 



135 
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Lys Glu Pro Val Asp Tyr Arg Ser Cys Ala Tyr Asp Trp Val Tyr 

140 145 150 

Ala 



<210> 15 

<211> 410 

<212> PRT 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<223> Incyte ID No: 2963414CD1 



<400> 15 
Met Val Val 
1 

Val Qln Leu 
Val Met Lys 
Arg Olu Gin 
Phe Thr Pro 
Asp Ser Thr 
Ser Gin Olu 
lie Trp Gin 
Cys Thr Leu 
Lys Glu Ser 
Asn Thr Pro 
Val Glu Arg 
Ala His Leu 
His Arg Phe 
Pro Cys Val 
Asp Ala Ser 
Gin Ser Thr 
Val Tyr Gin 
Tyr Gly Arg 
Gin Phe Leu 
Pro lie Leu 



Gin Asn Ser 
5 

Glu Pro Phe 
20 

Tyr Asp Glu 
35 

Arg Phe Tyr 

50 

Gin Tyr Lys 
65 

Gly His Leu 
80 

Pro Phe Lys 
95 

Thr Leu Gin 

110 
Ala Gin Trp 

125 
Pro Ala Lys 

140 

Ala Phe Ser 

155 
Lys Ser Phe 

170 
Thr Arg Leu 

185 
Leu Leu Leu 

200 
Leu Asp Leu 

215 
Glu Glu Lys 

230 
Ser Ala Cys 

245 
Thr Asp Lys 

260 
Lys Leu Ser 

275 
His Asn Gly 

290 
His Gin Leu 



Ala Asp Ala 
Leu His Gin 
His Thr Val 
Glu Ser Leu 
Gly Thr Val 
Ser Leu Val 
Val Ser Thr 
Gin Thr Thr 
Pro His Ala 
Ala Leu Leu 
Leu Val Glu 
Asn Pro Trp 
Cys Ser Glu 
Glu Asn Val 
Lys Met Gly 
Lys Ala Arg 
Leu Gly Val 
Lys Tyr Phe 
Val Glu Gly 
Ser His Leu 
Arg Ala Leu 



Gly Asp Met 

10 

Val Gly Gly 
25 

Cys Lys Pro 
40 

Pro Leu Ala 

55 

Thr Val His 
70 

Ala Asn Pro 
85 

Glu Ser Ala 

100 

Gly Ser Asn 
115 ' 

Gin Leu Ala 
130 

Arg Ser Glu 

145 

Asp Thr Asn 
160 

Gly Leu Gin 
175 

Tyr Pro Glu 

190 

Val Ser Gin 
205 

Thr Arg Gin 
220 

His Met Arg 

235 

Arg lie Cys 
250 

Leu Cys Lys 
265 

Phe Arg Gin 
280 

Arg Arg Glu 
295 

Leu Ser lie 



Arg Ala Gly 
15 

His Met Ser 
30 

Leu Val Ser 
45 

Met Lys Arg 
60 

Leu Trp Lys 
75 

Val Lys Glu 
90 

Ala Val Ala 

105 

Gly Ser Asp 
120 

Arg Ser Pro 
135 

Pro His Leu 

150 

Gly Asn Oln 
165 

Cys His Gin 
180 

Asn Lys Arg 

195 

Tyr Thr His 
210 

His Gly Asp 
225 

Lys Cys Ala 

240 

Gly Met Oln 
255 

Asp Lys Tyr 
270 

Ala Leu Tyr 
285 

Leu Leu Glu 
300 

lie Arg Ser 
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305 

Gin Ser Ser Tyr Arg 

320 

Asp Gly Gin Glu Pro 

335 

His Glu Ala Pro Gin 

350 

Thr Lys Val Asp lie 

365 

Lys Gly Tyr Trp Asn 

380 

Gly Tyr lie Phe Gly 

395 

lie Gin Glu Gly Glu 

410 



310 

Phe Tyr Ser Ser Ser 

325 

Pro Glu Arg Ala Pro 

340 

Ala Ala His Gly Ser 

355 

Arg Met lie Asp Phe 

370 

Glu His Thr Thr Tyr 

385 

Leu Glu Asn Leu lie 

400 



315 

Leu Leu Val lie Tyr 

330 

Gly Ser Pro His Pro 

345 

Ser Pro Gly Gly Leu 

360 

Ala His Thr Thr Tyr 

375 

Asp Gly Pro Asp Pro 

390 

Arg lie Leu Gin Asp 

405 



<210> 16 

<211> 1581 

<212> PRT 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<223> Incyte ID No: 7477139CD1 



<400> 16 
Met Ala Gly 

1 

Gly His Leu 
lie Gly Leu 
Lys Thr Gly 
Thr Pro Leu 
Gin Lys Ser 
Arg Thr Glu 
He Val Ser 
Gin Arg His 
Ser Val Thr 
Glu Asp Trp 
Ala His Leu 
Gin Asn Val 
Phe Gly Val 
Ser Phe lie 



Pro Gly Gly 
5 

Pro Asp Pro 
20 

Gly Thr Tyr 
35 

Ala Phe Thr 
50 

Pro Glu He 
65 

Val Gly Trp 
80 

Leu Asn Leu 
95 

Phe Tyr Gly 

110 
Gin Leu Trp 

125 
Asp Val Val 

140 
He Ala Tyr 

155 
His Ala His 

170 
Leu Leu Thr 

185 
Ser Ala Gin 

200 
Gly Thr Pro 

215 



Trp Arg Asp 
Thr Gly He 
Gly Arg He 
Ala Val Lys 
Gly Arg Arg 
Arg Tyr Ser 
Leu Arg Lys 
Ala Phe Phe 
Met Val Met 
Arg Met Thr 
He Cys Arg 
Arg Val He 
His Asn Ala 
Val Ser Arg 
Tyr Trp Met 



Arg Glu Val 
10 

Phe Ser Leu 
25 

Tyr Leu Gly 
40 

Val Met Asn 
55 

Val Arg Val 
70 

Asp Glu Glu 
85 

Tyr Ser Phe 
100 

Lys Leu Ser 

115 

Glu Leu Cys 
130 

Ser Asn Gin 
145 

Glu lie Leu 
160 

His Arg Asp 
175 

Glu Val Lys 
190 

Thr Asn Gly 

205 

Ala Pro Glu 
220 



Thr Asp Leu 
15 

Asp Lys Thr 
30 

Leu His Glu 
45 

Ala Arg Lys 
60 

Asn Lys Tyr 
75 

Glu Asp Leu 
90 

His Lys Asn 
105 

Pro Pro Gly 

120 

Ala Ala Gly 
135 

Ser Leu Lys 
150 

Gin Gly Leu 
165 

He Lys Gly 
180 

Leu Val Asp 
195 

Arg Arg Asn 
210 

Val He Asp 
225 
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Cys Asp Olu 
Trp Ser Val 
Pro Leu Cys 
Arg 6lu Ser 
Phe His Asn 
Arg Pro Thr 
lie Lys Asn 
Thr Gly lie 
Phe Glu Arg 
Phe Arg Gly 
Ser Ser Arg 
Gin Pro Arg 
Leu Gin Gin 
Ala Leu Asp 
Pro Gin Arg 
Ala Gin Val 
Lys Ala Pro 
Leu Gin Ala 
Gin Val Ser 
Gin Asp Leu 
Val Pro Glu 
Arg His Asn 
Pro Glu Gin 
Ala Glu Thr 
Gin Val Phe 
Pro Leu His 
Thr Glu Gly 
Gin Ala lie 



Asp Pro Arg 

230 
Gly He Thr 

245 
Asn Leu Gin 

260 
Ala Pro Thr 

275 
Phe Met Glu 

290 
Ser Ala Asn 

305 
Olu Arg His 

320 
He Lys Lys 

335 
Glu Glu Ala 

350 
Pro Ser Cys 

365 
Cys Arg Pro 

380 
Trp Leu Pro 

395 
Leu Gin Gly 

410 
Ser Ala Pro 

425 
Leu Gin Gly 

440 
Lys Ala Lys 

455 
Pro Arg Leu 

470 
Gin Val Arg 

485 
Lys Lys Gin 

500 
Asp Gin Val 

515 
Gin Gin Arg 

530 
Gin Val Pro 

545 
Pro Glu Val 

560 
Glu Ala Glu 

575 
Leu Pro Leu 

590 
Leu Asp Thr 

605 
Ser Pro Gin 

620 
Gly -Ser Val 

635 



Arg Ser Tyr 
Ala He Glu 
Pro Leu Glu 
Val Lys Ser 
Lys Cys Thr 
Met Leu Gin 
Val Val Glu 
Arg Oln Lys 
He Lys Glu 
Thr His Glu 
Leu Arg Val 
Asp Arg Glu 
Ala Ala Arg 
Lys Pro Leu 
Ala Ala Arg 
Ala Ser Lys 
Arg Arg Ala 
Ala Pro Arg 
Oln Ala Gin 
Pro Glu Glu 
Gin Gly Gin 
Glu Oln Glu 
Gin Glu Gin 
Glu Pro Glu 
Leu Ser Gin 
Gin Val Leu 
Ala Oln Ala 
Gin Ala Leu 



Asp Tyr Arg 
235 

Met Ala Glu 
250 

Ala Leu Phe 
265 

Ser Gly Trp 
280 

He Lys Asn 
295 

His Pro Phe 
310 

Ser Leu Thr 
325 

Lys Gly He 
340 

Oln Tyr Thr 
355 

Leu Leu Arg 
370 

Leu His Gly 
385 

Glu Pro Gin 
400 

Val Phe Met 
415 

Lys Gly Gin 
430 

Val Phe Met 
445 

Pro Leu Gin 

460 

Ala Arg Val 
475 

Leu Leu Gin 
490 

Thr Gin Thr 
505 

Phe Gin Gly 
520 

Ala Pro Glu 
535 

Leu Glu Oln 
550 

Ala Ala Glu 
565 

Ser Leu Arg 
580 

Asp His His 
595 

He Pro Val 
610 

Trp Thr Leu 
625 

He Glu Gly 
640 



Ser Asp Val 
240 

Gly Ala Pro 
255 

Val He Leu 
270 

Ser Arg Lys 
285 

Phe Leu Phe 
300 

Val Arg Asp 
315 

Arg His Leu 
330 

Pro Leu He 

345 

Val Arg Arg 
360 

Leu Pro Thr 
375 

Glu Pro Ser 
390 

Val Gin Ala 
405 

Pro Leu Gin 

420 

Ala Oln Ala 
435 

Pro Leu Gin 
450 

Met Oln He 

465 

Leu Met Pro 
480 

Val Oln Ser 
495 

Ser Glu Pro 
510 

Gin Asp Gin 
525 

Gin Gin Gin 
540 

Asn Gin Ala 

555 

Pro Ala Gin 
570 

Val Asn Ala 
585 

Val Leu Leu 

600 

Glu Gly Oln 
615 

Glu Pro Pro 
630 

Leu Ser Arg 
645 
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Lieu 




AXci 


&an Cai^ 
irro nSn OSL^ 


noiA 


A an 


Ss»Y* TiVa Pi^o Leii Glv 
uys tr*,\j ^^Ajf 














655 


660 

W V w 


irL.\j Lieu 




it 


T . A1 1 


new wxu noli 




Sep 


Seir Asn Ara Phe Tvr 








u V 3 






670 


675 


Cat" n n 


irro 


ft! 11 


VjXil 


riXCl \3XU 


T.VS 


LiVS 


Ser Lvs Val Ser Thr 








O DU 






V o ^ 


690 


T 1 A ^^^^ 
Li€U /\J:y 




nXcl 


T ai 1 


aXo Lijfo 






Pto Lv s a ira Phe A r a 
















705 


AXSi uys 


ser 


oer 


Trp 


Arg irro oxu 


Tats 

Liys 


T .01 1 
LiBU 


11 T.A1I Q^T" Ann Ti@ii 








"Tin 
/ xu 






71 R 

/ X 3 


720 

WAV 


\aXM Ala 


Arg 


Arg 


oxn 


Arg Ac^g vjxn 




A y n 
nX y 


Tt-o f3l 11 AQn Tl© Phe 








"7 0 C 








735 


TV n v% 1 ^« 


nxs 


uXu 


UXU 


A 1 1 1 T All ^i^n 

uxu Lieu nx^g 


V7XX1 


VAX 


A^n T.VQ Thf Ser Tm 
uyo Aiix 19^4* x^jgf 








/4U 






/ ft J 


7^n 


Arg om 


Trp 


Qxy 


irro 


oer Asp V7xn 


ijeu 


Tl A 

xxe 


Asp abxi oex Jrilc? xilX 








/ 33 






7fin 
/ ou 




(3iy nee 


Gin 


Asp 


Leu 


Liys Liys iyr 


ijeii 


T a 

Xjys 


nlv/ T.v^a TTVir* TVit^ PVi** 
wXy Ltjfa XllL ±liX f^ltC 








770 






77R 


780 


HIS Asn 


Val 


Gin 


Val 


vax xxe ly^x^ 


Arg 


a1 j» 

AXa 


X7!a 1 Tjvra Ol v Aqvi AQn 
VAX Ljjfa wxy nasi 








785 






"7 on 


7QS 


Asp vai 


Ala Thr Arg 


£>er mr vax 


Pro 


Gin 


A^/v Qoi^ T.011 T.mi Dl 11 
idCSX IjOU XjCSU wXU 








800 






805 


0 X w 


Gin Ala 


Gin 


Lys 


Pro 


He Asp He 


Arg 


Gin 


ArTf <>»T" Cot* oln A^n 
AX y OCX ^cx wxii noli 








815 






820 




Arg Oln 


Asn 


Trp 


Leu 


Ala Ala Ser 


Gly Asp 


•9c?x Liys* nxo uys* xxe 








830 






835 




Leu Ala Gly Lys Thr Gin Ser Tyr 


Cys 


Leu 


xnx xxe i>yx xxq ocx 








845 






850 




Glu Val 


Lys 


Lys 


Glu 


Glu Phe Gin 


Glu Gly 


Mat* Aon ol Tl T.v/o 
neu. ASfi wxii Xiys v^yo 








860 






865 


870 


. Oln Gly Ala Gin Val Gly Leu Gly 


Pro 


Glu 


ol V Hi Q CH/Q Tl e Tm 
V7xy nx0 wys9 xxe xx|^ 








875 






880 




Gin Leu Gly Glu Ser Ser Ser Glu 


Glu 


Glu 


Cat" PT-rk \75al T'Trr- Ol\/ 
06X Jrxu vax xiix wxy 








890 






895 


7 UU 


Arg Arg 


Ser 


Gin 


Ser 


Ser Pro Pro 


Tyr 


Ser 


'P'Vtv Tl a Aon Ol n T.Vfa 

inr xxe Asp oxn xiys 








905 






910 


Ql R 

7X3 


Leu Leu 


Val 


Asp 


He 


His Val Pro 


Asp Gly 


irne Xiys vax uxy xiys 








920 






925 


7 u 


lie Ser 


Pro 


Pro 


Val 


Tyr Leu Thr 


Asn 


Glu 


X X p vax uxy ^yx ash 








935 






940 




Ala Leu 


Ser 


Glu 


He 


Phe Arg Asn 


Asp Trp 


T at 1 T'Vi 1^ Pi^/^ Ala Pi^n 
woU XHx f^X^O AXa IrxiiJ 








950 






955 


7 0 u 


Val He 


Gin 


Pro 


Pro 


Glu Glu Asp 


Gly Asp 


THr*" \7a 1 ol 11 T.aii TH/t* 
lyx VclX VJXU iJCU ^yx 








965 






970 




Asp Ala 


Ser Ala Asp Thr Asp Gly 


Asp 


Asp 


&an Aon ol 11 C^T" A^n 

A&t^ wXU OCX natJ. 








980 






985 


990 

7 7 V 


Asp Thr Phe Glu Asp Thr Tyr Asp 


His 


Ala 


Aan Ol V Asn Asn Aan 

AOll wxy ADll A9j^ AO^ 








995 




1000 


X u V -J 


Leu Asp Asn Gin Val Asp Gin Ala 


Asn 


Asp 


\75i 1 T.\7Q AQn Hi ^ 
vox wy a uy B a5>^ nxo 








1010 




1015 


1 090 


Asp Asp 


Asp 


Asn 


Asn 


Lys Phe Val 


Asp 


Asp 


\7sil A en Aan Aisn TV/v* 
vdx ASH Asn Asri xyx 








1025 






1030 


1035 


Tyr Glu Ala Pro 


Ser 


Cys Pro Ser 


Leu 


Leu 


Ser Gly Gin Ala Met 








1040 






1045 


1050 


Ala Glu 


Met 


Glu 


Ala 


Ala Ser Lys 


Met 


Val 


Met Met Glu Val Val 








1055 






1060 


. 1065 
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Olu Lvs 


Arcf 


Lvs Pro 


Thr 


Qlu 


Ala 


Met Glu 


Ala 

mm 


He 


Oln Pro He 






1070 








1075 






1080 


Glu Ala 


Met 

• 


Glu Glu 


Val 


Gin 


Pro 


Val Arg 


Asp 


Asn 


Ala Ala He 






1085 








1090 






1095 


Glv AsD 


Gin 


Glu Glu 


His 


Ala 

mm ^m 


Ala 


Asn He 

m mmrm m ^k ifc ^m 


Glv 


Ser Glu Arg Arg 






1100 








1105 






1110 


Qlv Ser 


Glu 


Glv AsD 


Qlv 


Qlv 


Lvs 


Qlv Val 


Val 


Ara 


Thr Ser Qlu 






1115 

A ^ 








1120 






1125 

^k ^k m0 ^m 


Glu Ser 


Glv 


Ala Leu 


Qlv 


Leu 


Asn 


Qlv Qlu 


Qlu 


Asn 


Cvs Ser Qlu 






1130 








1135 






1140 

^k m 


Thr AsD 


Glv 


Pro Glv 


Leu 

m^ ^m Wft 


Lvs 

mm J 


Arcr 


Pro Ala 


Ser 


Gin 

AAA 


AsD Phe Glu 






1145 








1150 

X X ^ w 






1155 

A A a# 


Tvr Leu 


Gin 


Glu Qlu 


Pro 


Qlv 


Qlv 


Glv A an 


Qlu 

wX U 


Ala 


Ser Asn Ala 

ii^CX ^IBK f%XCt 






1160 








1165 

X X O 






1 170 

X X / w 


Xle AsD 


Ser 


Glv Ala 


Ala 


Pro 


Ser 


Ala Pro 


Asn 


His 

XAXB 


Glu Ser Asn 

wXU OBX AB^ 






1175 








1180 

X X O V 






1185 


Asn Lvs 


Ast) 


Tie Seir 


Qlu 


Ser 


Ser 


THr Gl n 

X IIX VJXlt 


Ser 


Asp 


Phe Ser Ala 






1190 








1195 

X X 7 






1200 


Asn 


Seir 


Seir Piro 

A \y 


SeiT 


Lva 

XJ B 


Glv 
varxy 


Ser Gl V 
0OX wxy 


Met- 


Ser 


Ala Asp Ala 






1205 








1 51 0 

X ^ X w 






1215 


Asn Phe 


Ala 


Ser Ala 


He 

A A 


Leu 


Tvr 
xyx 


Ala Glv 


Phe 


Val 


Qlu Val Pro 






1220 

JL 4M M W 








1225 

X A A J 






1230 


Glu Qlu 


Ser 


Pro Lvs 


Gin 

^m AAA 


Pro 


Ser 


Glu Val 


Asn 


Val 


Asn Pro Leu 






1235 








1240 






1245 


Tyr Val 


Ser 


Pro Ala 


Cvs 


Lvs 

mm J iv 


Lvs 


Pro Leu 


He 


His Met Tyr Qlu 






1250 








1255 






1260 


Lvs Glu 


Phe 


Thr Ser 

4 ^^^^ 


Glu 


He 

A A ^9 


Cvs 


Cvs Glv 

^^jf ^ \^i^y 


Ser 

mQX 


Leu Trp Gly Val 






1265 








1270 

Xa / V 






1275 


Asn Leu 


Leti 


Leu Glv 


Thr 

J> A4X 


Am 
r»x y 


Ser 


Asn T.eii 




Leu 


Met Asp Arg 






1280 








1285 

X A O W 






1290 


Ser Qlv 


Lvs 


Ala AsD 


He 

A A ^9 


Thr 


Lva 

xjy a 


Leu Tie 

XJ9U XXV 


Arpf 

AX y 


Arg 


Arg Pro Phe 






1295 








1300 






1305 


Ara Gin 


lie 


Gin Val 


Leu 


Glu 

vx u 


Pro 

X X w 


T.eii Asn 


T.011 


Leu 


He Thr He 






1310 








131*^ 

X J X 9 






1320 


Sez: Glv 


His 

AAA ^9 


Lvs Asn 


AT*Cf 
nA 


T<eu 

vx 


Arrt 
nx \j 


Val Tvr 

vct^ xyx 


His 
nx B 


Leu Thr Trp Leu 






1325 








1 330 

X J W 






1335 


Aira Asn 


Lvs 




A^n 


Asn 




Pm Gl 11 

xXV K7XU 


OCX 


Lys 


Arg Arg Qln 






1340 

X "S W 








1 3A5 






• 1350 


Glu Glu 


Met 


Leu Lvcs 


Thr 

X XIX 


Glu 

V7X U 


Glu 


Al A f^vs 
Axcx ^y o 


Xjy 9 


Ala 


He Asp Lys 






1355 








1 3fi0 






1365 


Leu THr 


Glv 


Cvs Olti 

N«»Jf w \7X U 


His 

nx B 


Phe 


iJPX 


Val Leu 


Gl n 


His 


Glu Glu Thr 






1370 








1 375 






1380 


Thr Tvr 


lie 


Ala lie 


Ala 


Leu 


Lvs 

mmjf ^m 


Ser Ser 


He 

X X B 


His 


Leu Tyr Ala 






1385 








1390 






1395 


Trp Ala 


Pro 


Lvs Ser 

mm J ^m ^m m» 


Phe 


ASD 


Qlu 

^m ^ %4 


Ser Thr 

^^^^^ J» 


Ala 


He 


Lys Val Phe 






1400 

A V W V 








1405 

X V w W 






1410 


Pro Thr 


Leu 

MM ^Iff %4 


AsD His 


Lvs 


Pro 

Ai ^ v# 


Val 


Thr Val 

X &x^ vox 


Asn 
nB^ 


Leu Ala He Oly 






1415 

A 7 *s 








1420 






1425 


Ser Glu 


Lvs 


Ara Leu 


Lvs 

mm^f mm 


He 

^k ^9 


Phe 


Phe Ser 


Ser 


Ala Asp Gly Tyr 






1430 








1435 

XV (J «^ 






1440 


Hi.s Leu 


lie 


AsD Ala 

s&0J|a^ <^An 


Qlu 


Ser 
ox 


Qlu 

SJX u 


Val Me^ 

vox Vl C7 l« 


Qer 

OCX 


Asp 


Val Thr Leu 














J.43U 






1455 


Pro Lys 


Asn 


Asn lie 


He 


He 


Leu 


Pro Asp 


Cys 


Leu Gly He Gly 






1460 








1465 






1470 


Met Met 


Leu 


Thr Phe 


Asn 


Ala 


Qlu 


Ala Leu 


Ser 


Val 


Qlu Ala Asn 






1475 








1480 






1485 
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Glu Gin Leu Phe Lys Lys lie Leu Glu Met Trp Lys Asp lie Pro 

1490 1495 1500 

Ser Ser He Ala Phe Glu Cys Thr Gin Arg Thr Thr Gly Trp Gly 

1505 1510 1515 

Gin Lys Ala He Glu Val Arg Ser Leu Gin Ser Arg Val Leu Glu 

1520 1525 1530 

Ser Glu Leu Lys Arg Arg Ser He Lys Lys Leu Arg Phe Leu Cys 

1535 1540 1545 

Thr Arg Gly Asp Lys Leu Phe Phe Thr Ser Thr Leu Arg Asn His 

1550 1555 1560 

His Ser Arg Val Tyr Phe Met Thr Leu Gly Lys Leu Glu Glu Leu 

1565 1570 1575 

Gin Ser Asn Tyr Asp Val 

1580 

<210> 17 
<211> 1084 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 55009053CD1 

<400> 17 



Met 


Glu 


Thr 


Gin 


Ala Val Ala Thr Ser 


Pro Asp Gly Arg Tyr 


Leu 


1 








5 


10 


15 


Lys 


Phe 


Asp 


He 


Glu He Gly Arg Gly 


Ser Phe Lys Thr Val 


Tyr 










20 


25 


30 


Arg Gly Leu 


Asp 


Thr Asp Thr Thr Val 


Glu Val Ala Trp Cys 


Glu 










35 


40 


45 


Leu 


Gin 


Thr 


Arg 


Lys Leu Ser Arg Ala 


Glu Arg Gin Arg Phe 


Ser 










50 


55 


60 


Glu 


Glu 


Val 


Glu 


Met Leu Lys Gly Leu 


Gin His Pro Asn He 


Val 










65 


70 


75 


Arg 


Phe Tyr 


Asp 


Ser Trp Lys Ser Val 


Leu Arg Gly Gin Val 


Cys 










80 


85 


90 


He 


Val 


Leu 


Val 


Thr Glu Leu Met Thr 


Ser Gly Thr Leu Lys 


Thr 










95 


100 


105 


Tyr 


Leu 


Arg 


Arg 


Phe Arg Glu Met Lys 


Pro Arg Val Leu Gin 


Arg 










110 


115 


120 


Trp 


Ser 


Arg 


Gin 


He Leu Arg Gly Leu 


His Phe Leu His Ser 


Arg 










125 


130 


135 


Val 


Pro 


Pro 


He 


Leu His Arg Asp Leu 


Lys Cys Asp Asn Val 


Phe 










140 


145 


150 


He Thr Gly 


Pro 


Ser Gly Ser Val Lys 


He Gly Asp Leu Gly 


Leu 










155 


160 


165 


Ala 


Thr 


Leu 


Lys 


Arg Ala Ser Phe Ala 


Lys Ser Val He Gly 


Thr 










170 


175 


180 


Pro 


Glu 


Phe 


Met 


Ala Pro Glu Met Tyr 


Glu Glu Lys Tyr Asp 


Glu 










185 


190 


195 


Ala 


Val 


Asp 


Val 


Tyr Ala Phe Gly Met 


Cys Met Leu Glu Met 


Ala 










200 


205 


210 


Thr 


Ser 


Glu 


Tyr 


Pro Tyr Ser Glu Cys 


Gin Asn Ala Ala Gin 


He 










215 


220 


225 


Tyr Arg 


Lys 


Val 


Thr Ser Gly Arg Lys 


Pro Asn Ser Phe His 


Lys 
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Val Lys He 
Thr Asp Lys 
Ala Phe Phe 
Glu Asp Asp 
Glu Asp Ala 
Olu Phe Leu 
Gin Glu Met: 
Pro Val Ala 
Lys Arg Glu 
Pro Glu Pro 
Pro Pro Ser 
Gin His Gin 
Thr Ser Asp 
Leu Asp Ala 
Ser Ser Leu 
Leu Ser lie 
Asp Ser Tyr 
Glu Gly Met 
Arg Arg Pro 
Asn Asp Arg 
Met Val Thr 
He Ala Ala 
Arg Asp Gly 
Glu Thr Leu 
Asp Thr Leu 
Pro Val Pro 
Thr Ser Leu 
Ser Ser Pro 



230 
Pro Glu Val 

245 
Asn Glu Arg 

260 
Arg Glu Glu 

275 
Gly Glu Lys 

290 
Arg Arg Gly 

305 
Phe Gin Leu 

320 
Val Ala Leu 

335 
Arg Ala Val 

350 
Lys Leu Arg 

365 
Gly Pro Pro 

380 
Val Phe Pro 

395 

Pro Phe Leu 

410 
Cys Glu Thr 

425 
Ser Asp Pro 

440 
Ala Glu Ser 

455 
Pro Arg Ser 

470 
Ala Ser Asp 

485 
Gly Gin Met 

500 
Arg Ser Arg 

515 
Val Val Glu 

530 
Phe Arg Phe 

545 
Ala Met Val 

560 
Phe Leu Arg 

575 
Leu Lys Arg 

590 
Ser Pro Gin 

605 
Leu Pro Asp 

620 
Glu His Arg 

635 
Gly Thr Pro 



Lys Glu He 
Phe Thr He 
Arg Gly Val 
Pro Gly Leu 
Gly Arg Pro 
Gly Arg Asp 
Gly Leu Val 
Arg Glu Arg 
Lys Ala Arg 
Pro Ala Thr 
Pro Glu Pro 
Phe Arg His 
Asp Gly Tyr 
Ala Leu Gin 
His Leu Cys 
Gly Pro Gly 
Ala Ala Ser 
Arg Arg Pro 
Leu Arg Val 
Cys Gin Leu 
Asp Leu Asp 
Tyr Asn Glu 
Arg He Arg 
Asp Thr Gly 
Glu Olu Pro 
Pro Ser Asn 
Ser Trp Thr 
Leu Ser Pro 



235 

He Glu Gly 
250 

Gin Asp Leu 

265 

His Val Glu 
280 

Lys Leu Trp 
295 

Arg Asp Asn 
310 

Ala Ala Glu 
325 

Cys Glu Ala 
340 

Val Ala Ala 

355 

Glu Leu Glu 
370 

Val Pro Met 
385 

Glu Glu Pro 

400 

Ala Ser Tyr 
415 

Leu Ser Ser 
430 

Pro Pro Gly 

445 

Leu Pro Ser 
460 

Ser Asp Phe 
475 

Qly Leu Ser 

490 

Pro Gly Arg 
505 

Thr Ser Val 
520 

Gin Thr His 
535 

Gly Asp Ser 
550 

Phe He Leu 

565 

Glu He He 
580 

Pro Met Glu 
595 

Ala Pro Leu 

610 

Glu Glu Leu 
625 

Ala Phe Ser 
640 

Gly Asn Pro 



240 

Cys He Arg 
255 

Leu Ala His 

270 

Leu Ala Glu 
285 

Leu Arg Met 
300 

Gin Ala He 

315 

Glu Val Ala 
330 

Asp Tyr Gin 
345 

He Gin Arg 

360 

Ala Leu Pro 
375 

Ala Pro Gly 
390 

Glu Ala Asp 

405 

Ser Ser Thr 
420 

Ser Gly Phe 
435 

Qly Val Pro 

450 

Ala Phe Ala 
465 

Ser Pro Gly 
480 

Asp Val Gly 

495 

Asn Leu Arg 
510 

Ser Asp Gin 
525 

Asn Ser Lys 

540 

Pro Glu Glu 
555 

Pro Ser Glu 

570 

Gin Arg Val 
585 

Ala Ala Glu 
600 

Pro Ala Leu 

615 

Gin Ser Ser 
630 

Thr Ser Ser 
645 

Phe Ser Pro 
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650 




655 


660 


Gly Thr Pro lie Ser Pro Qly Pro 


lie Phe Pro 


lie Thr Ser Pro 


665 




670 


^ ^ ^ 

675 


Pro Cys His Pro Ser 


Pro Ser Pro 


Phe Ser Pro 


lie Ser Ser Gin 


680 




685 


690 


Val Ser Ser Asn Pro 


Ser Pro His 


Pro Thr Ser 


Ser Pro Leu Pro 


695 




700 


705 


Phe Ser Ser Ser Thr 


Pro Glu Phe 


Pro Val Pro 


Leu Ser Gin Cys 


710 




715 


720 


Pro Trp Ser Ser Leu 


Pro Thr Thr 


Ser Pro Pro 


Thr Phe Ser Pro 


725 




730 


735 


Thr Cys Ser Gin Val 


Thr Leu Ser 


Ser Pro Phe 


Phe Pro Pro Cys 


740 




745 


750 


Pro Ser Thr Ser Ser 


Phe Pro Ser 


Thr Thr Ala 


Ala Pro Leu Leu 


755 




760 


765 


Ser Leu Ala Ser Ala 


Phe Ser Leu 


Ala Val Met 


Thr Val Ala Gin 


770 




775 


780 


Ser Leu Leu Ser Pro 


Ser Pro Gly Leu Leu Ser 


Gin Ser Pro Pro 


785 




790 


795 


Ala Pro Pro Ser Pro 


Leu Pro Ser 


Leu Pro Leu 


Pro Pro Pro Val 


800 




805 


810 


Ala Pro Gly Qly Gin Qlu Ser Pro 


Ser Pro His 


Thr Ala Glu Val 


815 




820 


825 


Glu Ser Glu Ala Ser 


Pro Pro Pro Ala Arg Pro Leu Pro Gly Qlu 


830 




835 


840 


Ala Arg Leu Ala Pro 


lie Ser Glu Glu Gly Lys 


Pro Gin Leu Val 


845 




850 


855 


Qly Arg Phe Gin Val 


Thr Ser Ser 


Lys Glu Pro 


Ala Qlu Pro Leu 


860 




865 


870 


Pro Leu Gin Pro Thr 


Ser Pro Thr 


Leu Ser Gly Ser Pro Lys Pro 


875 




880 


885 


Ser Thr Pro Gin Leu 


Thr Ser Glu 


Ser Ser Asp 


Thr Glu Asp Ser 


890 




895 


900 


Ala Gly Gly Gly Pro Glu Thr Arg Glu Ala Leu Ala Glu Ser Asp 


905 




910 


915 


Arg Ala Ala Glu Gly Leu Gly Ala Gly Val Glu Glu Glu Gly Asp 


920 




925 


930 


Asp Gly Lys Glu Pro Gin Val Gly Gly Ser Pro 


Gin Pro Leu Ser 


935 




940 


945 


His Pro Ser Pro Val 


Trp Met Asn Tyr Ser Tyr 


Ser Ser Leu Cys 


950 




955 


960 


Leu Ser Ser Glu Glu Ser Glu Ser Ser Gly Glu Asp Glu Glu Phe 


965 




970 


975 


Trp Ala Glu Leu Gin 


Ser Leu Arg 


Gin Lys His 


Leu Ser Glu Val 


980 




985 


990 


Glu Thr Leu Gin Thr 


Leu Gin Lys 


Lys Glu lie 


Glu Asp Leu Tyr 


995 




1000 


1005 


Ser Arg Leu Qly Lys Gin Pro Pro 


Pro Gly lie Val Ala Pro Ala 


1010 




1015 


1020 


Ala Met Leu Ser Ser Arg Gin Arg Arg Leu Ser Lys Gly Ser Phe 


1025 




1030 


1035 


Pro Thr Ser Arg Arg 


Asn Ser Leu 


Gin Arg Ser 


Qlu Pro Pro Gly 


1040 




1045 


1050 


Pro Gly lie Met Arg 


Arg Asn Ser 


Leu Ser Gly 


Ser Ser Thr Gly 


1055 




1060 


1065 


Ser Qln Glu Gin Arg Ala Ser Lys Gly Val Thr Phe Ala Qly Asp 
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1070 



1075 



1080 



Val Gly Arg Met 
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<220> 

<221> misc_f eature 

<223> Incyte ID No: 7474648CD1 



<400> 18 
Met Gly Olu 
1 

Thr Glu Asn 
Asp Asn Glu 
Gly Pro Val 
Glu lie Pro 
Gly Cys Gin 
Glu Thr His 
Trp Glu Leu 
Pro Gly Val 
Asp His Asp 
Arg Ser Cys 
Ser Arg Val 
Gly Val Arg 
Gly Leu Gly 
Arg Gly Ala 
Thr Gly Gly 
Gin Cys Pro 
Gin Arg Met 
Arg Asn Ser 
Thr Gly Ser 
Lys Lys Pro 



Ser Gly Asn 
5 

Gin Thr Ala 
20 

Asn lie Trp 

35 

Met Gly Trp 
50 

Asn Val Gly 
65 

Leu Pro Cys 

80 

Val Val Arg 
95 

Arg Pro Ala 

110 
Ser Val Val 

125 

Pro Pro Val 

140 
Ser Leu lie 

155 
Pro Arg Ser 

170 

Arg Gin Gin 

185 
Gin Ala Arg 

200 
Trp Ala Arg 

215 
Ser Leu Pro 

230 
Val Oln Phe 

245 
Gly Lys Lys 

260 
Tyr Leu Arg 

275 
Phe Ser Arg 

290 
Phe Ala lie 

305 



His His Phe 
His Val Leu 
Ala Gin Gly 
Lys Ala Arg 
Thr Leu Thr 
Cys Lys Asp 
Ser Gly Gly 
Asp Ser His 
Ser Gly Tyr 
Gly Phe Leu 
Lys Gly Gly 
Arg Glu Arg 
Glu Gly Gly 
Arg Ser Lys 
Gly Gly Gly 
Ser Gin Val 
Leu Phe Asp 
Arg Ala Gly 
Arg Tyr Asp 
Val Val Arg 
Lys Val Met 



Gin Gin Thr 
10 

Thr His Lys 
25 

Gly Glu His 
40 

Ser Gly Lys 
55 

Leu Leu Thr 
70 

Thr Gin Ala 

85 

Leu Leu Pro 
100 

Thr Val Thr 
115 

Pro Gly Gly 

130 

Ser Glu Gly 
145 

Gly Thr Gly 
160 

Arg Ala Cys 

175 

Pro Gly Ala 
190 

Pro Ser Arg 
205 

Pro Gly Gly 
220 

Arg Pro Pro 
235 

lie Ser Glu 
250 

Ala Ala Ala 
265 

lie Lys Ala 
280 

Val Glu Gin 

295 

Glu Thr Arg 
310 



Asn Thr Gly 
15 

Trp Glu Leu 
30 

His Lys Leu 
45 

Thr Leu Gly 
60 

Gly Tyr Gly 
75 

Ala Tyr Gly 

90 

Thr Ala Ser 
105 

Ser Asp Asp 
120 

Cys Leu Pro 
135 

Pro Ala Pro 
150 

Leu Ala Ala 
165 

Cys Gly Tyr 
180 

Thr Ser Ala 
195 

Arg Arg Arg 
210 

Ala Glu Asp 
225 

Gly Pro Cys 
240 

Gin Gly Val 
255 

Asn Lys Gly 
270 

Leu lie Gly 
285 

Lys Thr Thr 

300 

Olu Arg Glu 
315 
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Gly Arg Glu 
Ser His Arg 
Asp Gin Val 
Phe Asp Arg 
Val Arg He 
Ala Leu Gin 
Tyr Tyr His 
Gly Leu Ala 
Thr Leu Cys 
Arg Lys Pro 
He Thr Tyr 
Ser Gin Thr 
Tyr Thr Gly 
Phe He Asp 
Ala Gly Gin 
Gly Ser Ser 
Met Gin Arg 
Ser Ser Lys 
Lys Arg Asn 



Ala Cys Val 

320 
Tyr He Val 

335 
Tyr Met Val 

350 
Leu He Ala 

365 
Leu Gin Met 

380 
He Thr His 

395 
Pro Gly Glu 

410 
Tyr Ser Gly 

425 
Gly Thr Pro 

440 
Tyr Thr Ser 

455 
Ala Leu Leu 

470 
Arg Leu Tyr 

485 
Glu Pro Trp 

500 
Lys Leu Leu 

515 
Ala Leu Asp 

530 
Met Lys Asn 

545 
Ala Ser Pro 

560 
Ser His Tyr 

575 
Leu Arg He 

590 



Ser Glu Leu 
Gin Leu Met 
Met Glu Leu 
Gin Gly Ser 
Val Ala Asp 
Arg Asn Leu 
Glu Ser Lys 
Lys Lys Ser 
Qlu Tyr He 
Ala Val Asp 
Ser Gly Phe 
Arg Lys He 
Pro Ser He 
He Leu Glu 
His Pro Trp 
Leu Gin Arg 
His Ser Gin 
Ser His Lys 
Val Glu Ser 



Ser Val Leu 

325 

Glu He Phe 
340 

Ala Thr Gly 
355 

Phe Thr Glu 
370 

Gly He Arg 
385 

Lys Pro Glu 
400 

He Leu He 
415 

Gly Asp Trp 
430 

Ala Pro Glu 
445 

Met Trp Ala 
460 

Leu Pro Phe 
475 

Leu Lys Gly 
490 

Ser His Leu 
505 

Ala Gly His 
520 

Val He Thr 
535 

Ala He Ser 
550 

Ser Pro Gly 
565 

Ser Arg His 
580 

Pro Leu Ser 
595 



Arg Arg Val 

330 

Glu Thr Glu 
345 

Gly Glu Leu 
360 

Arg Asp Ala 

375 

Tyr Leu His 
390 

Asn Leu Leu 
405 

Thr Asp Phe 
420 

Thr Met Lys 
435 

Val Leu Leu 
450 

Leu Gly Val 
465 

Asp Asp Glu 
480 

Lys Tyr Asn 
495 

Ala Lys Asp 

510 

Arg Met Ser 
525 

Met Ala Ala 
540 

Arg Asn Leu 
555 

Ser Ala Gin 
570 

Met Trp Ser 
585 

Ala Leu Leu 
600 



<210> 19 

<211> 1114 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 7483053CD1 

<400> 19 

Met Ala Lys Ala Thr Ser Gly Ala Ala Gly Leu Arg Leu Leu Leu 
15 10 15 

Leu Leu Leu Leu Pro Leu Leu Gly Lys Val Ala Leu Gly Leu Tyr 

20 25 30 

Phe Ser Arg Asp Ala Tyr Trp Glu Lys Leu Tyr Val Asp Gin Ala 

35 40 45 
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Ala Gly Thr 
Qlu Glu Val 
Tyr Arg Thr 
Asp Thr Gly 
Trp Glu Lys 
Val Tyr Leu 
Glu Cys Gin 
Asn Thr Ser 
Cys Phe Pro 
Pro Pro Gly 
Leu Cys Pro 
Qly Leu Pro 
Arg Trp Ala 
Ala Val Cys 
Val Pro Phe 
Thr Phe Pro 
Lys Arg Lys 
Ala Asp Val 
Ser Thr Leu 
Val Olu His 
Phe Val Arg 
Asn Leu Ser 
Val Asn Asp 
Leu His Phe 
Ser Thr Tyr 
Gin He Gly 
He Asn Val 
Thr Leu Gly 




Pro Leu Leu 

50 

Pro Ser Phe 
65 

Arg Leu His 
80 

Leu Leu Tyr 
95 

Leu Ser Val 

110 
Lys Val Phe 

125 
Trp Pro Gly 

140 
Phe Pro Ala 

155 
Glu Thr Arg 

170 
Thr Phe His 

185 
Asn He Ser 

200 
Phe Arg Cys 

215 
Leu Asp Arg 

230 
Thr Val His 

245 
Pro Val Thr 

260 
Ala Gly Val 

275 

Glu Asp Thr 

290 
Val Pro Ala 

305 
Leu Pro Gly 

320 
Trp Pro Asn 

335 
Ala Thr Val 

350 
He Ser Glu 

365 
Ser Asp Phe 

380 
Asn Val Ser 

395 
Ser Leu Ser 

410 
Lys Val Cys 

425 
Gin Tyr Lys 

440 
Val Val Thr 

455 



Tyr Val His 
Arg Leu Qly 
Glu Asn Asn 
Leu Asn Arg 
Arg Asn Arg 
Leu Ser Pro 
Cys Ala Arg 
Cys Ser Ser 
Pro Ser Phe 
Gin Phe Arg 
Val Ala Tyr 
Ala Pro Asp 
Glu Gin Arg 
Ala Gly Ala 
Val Tyr Asp 
Asp Thr Ala 
Val Val Ala 
Ser Gly Glu 
Asp Thr Trp 
Glu Thr Ser 
His Asp Tyr 
Asn Arg Thr 
Gin Gly Pro 
Val Leu Pro 
Val Ser Arg 
Val Glu Asn 
Leu His Ser 
Ser Ala Glu 



Ala Leu Arg 

55 

Gin His Leu 
70 

Trp He Cys 
85 

Ser Leu Asp 
100 

Gly Phe Pro 
115 

Thr Ser Leu 
130 

Val Tyr Phe 
145 

Leu Lys Pro 
160 

Arg He Arg 
175 

Leu Leu Pro 

190 

Arg Leu Leu 
205 

Ser Leu Glu 
220 

Glu Lys Tyr 

235 

Arg Glu Glu 
250 

Olu Asp Asp 
265 

Ser Ala Val 

280 

Thr Leu Arg 
295 

Leu Val Arg 
310 

Ala Gin Gin 

325 

Val Gin Ala 
340 

Arg Leu Val 
355 

Met Gin Leu 
370 

Gly Ala Gly 
385 

Val Ser Leu 
400 

Arg Ala Arg 

415 

Cys Gin Ala 
430 

Ser Qly Ala 
445 

Asp Thr Ser 
460 
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Asp Ala Pro 

60 

Tyr Gly Thr 
75 

He Gin Glu 
90 

His Ser Ser 
105 

Leu Leu Thr 
120 

Arg Glu Gly 
135 

Ser Phe Phe 

150 

Arg Glu Leu 
165 

Glu Asn Arg 
180 

Val Gin Phe 

195 

Glu Gly Glu 
210 

Val Ser Thr 
225 

Glu Leu Val 
240 

Val Val Met 
255 

Ser Ala Pro 
270 

Val Glu Phe 

285 

Val Phe Asp 
300 

Arg Tyr Thr 
315 

Thr Phe Arg 

330 

Asn Gly Ser 
345 

Leu Asn Arg 
360 

Ala Val Leu 

375 

Val Leu Leu 
390 

His Leu Pro 
405 

Arg Phe Ala 

420 

Phe Ser Gly 
435 

Asn Cys Ser 
450 

Gly He Leu 
465 
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Phe Val Asn Asp 
Tyr Met 
Ala Oln 



Leu His 
Ala Gin 
Glu Glu 
Leu Glu 



Ala Gly 
Cys Glu 



Cys Glu 
Ser Thr 



Trp Arg 
Cys Ser 
Asp Val Val Glu 



Leu Arg 
Gly He 
Glu Lys 
Asp Glu 
Phe He 



Gly Ser 
Lys Ala 
Cys Phe 
Leu Cys 
Val Ser 



His Lys Phe Ala 
Phe Arg 
Ser Ala 



Val Asp 
Arg Lys 
Gly Lys 
Gly Tyr 
Pro Ser 
Gin Val 
Gin Asp 
Ser Leu 



Arg Pro 
Arg Arg 
Ala Phe 



Tyr Leu 
Pro Asp 
Trp Gin 

Val His Arg Asp 



Asn Leu 
Val Val 
Thr Thr 
Glu Leu 
Asn His 
Gly Pro 
Arg Gly 
Gly Ser 
Glu Arg 
He Ser 



Thr 
470 
Val 
485 
Leu 
500 
Cys 
515 
Glu 
530 
Gin 
545 
Pro 
560 
Thr 
575 
He 
590 
Gly 
605 
Cys 
620 
Arg 
635 
Val 
650 
His 
665 
Ala 
680 
Pro 
695 
Lys 
710 
Val 
725 
Lys 
740 
Val 
755 
Arg 
770 
Pro 
785 
Leu 
800 
Phe 
815 
Gly 
830 
Ala 
845 
Gin 
860 
Leu 
875 



Lys 

Val 

Leu 

Pro 

Cys 

Gly 

Ser 

Gin 

Val 

Tyr 

Glu 

Thr 

Leu 

Lys 

Gin 

Ser 

He 

Leu 

Ala 

Ala 

Asp 

His 



Ala Leu Arg 
Ala Thr Asp 
Val Thr Val 
Leu Ser Cys 
Gly Gly Leu 
Asp Gly Lys 
Thr Lys Thr 
Asp He Asn 
Gly Gly His 
Gly Thr Cys 
Pro Glu Asp 
Val He Ala 
Leu Ser Ala 
Pro Pro He 
Ala Phe Pro 
Leu Asp Ser 
Leu Glu Asp 
Gly Lys Thr 
Thr Ala Phe 
Val Lys Met 
Leu Leu Ser 
Val He Lys 
Leu He Val 



Leu 

Leu Arg Glu Ser 
Gly 
Leu 



Gly 
Ala 



Ser Arg Asn 
Thr Met Gly 
Met Gin Tyr 
Ala Arg Asn 



Arg Pro 
475 

Gin Gin 
490 

Glu Gly 
505 

Ala Val 
520 

Gly Ser 

535 

Gly He 
550 

Cys Pro 
565 

He Cys 

580 

Glu Pro 
595 

Asn Cys 
610 

He Gin 

625 

Ala Ala 
640 

Phe Cys 

655 

Ser Ser 
670 

Val Ser 
685 

Met Glu 
700 

Pro Lys 
715 

Leu Gly 
730 

His Leu 
745 

Leu Lys 
760 

Glu Phe 
775 

Leu Tyr 
790 

Glu Tyr 
805 

Arg Lys 
820 

Ser Ser 
835 

Asp Leu 
850 

Leu Ala 

865 

He Leu 
880 



Lys Cys 
Thr Ser 
Ser Tyr 
Ser Lys 
Pro Thr 
Thr Arg 
Asp Gly 
Pro Gin 
Gly Glu 
Phe Pro 
Asp Pro 
Val Leu 
He His 
Ala Glu 
Tyr Ser 
Asn Gin 
Trp Glu 
Glu Gly 
Lys Gly 
Glu Asn 
Asn Val 
Gly Ala 
Ala Lys 
Val Gly 
Ser Leu 
He Ser 
Glu Met 
Val Ala 



Ala Glu 
480 
Arg Gin 

495 
Val Ala 

510 
Arg Arg 

525 
Gly Arg 

540 
Asn Phe 

555 
His Cys 

570 
Asp Cys 

585 
Pro Arg 

600 
Glu Glu 

615 
Leu Cys 

630 
Phe Ser 

645 
Cys Tyr 

660 
Met Thr 

675 
Ser Ser 

690 
Val Ser 

705 
Phe Pro 

720 
Glu Phe 

735 
Arg Ala 

750 
Ala Ser 

765 
Leu Lys 

780 
Cys Ser 

795 
Tyr Gly 

810 
Pro Gly 

825 
Asp His 

840 
Phe Ala 

855 
Lys Leu 

870 
Glu Gly 

885 
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Arg Lys Met Lys lie Ser Asp Phe Qly Leu Ser Arg Asp Val Tyr 

890 895 900 

Glu Glu Asp Ser Tyr Val Lys Arg Ser Gin Gly Arg lie Pro Val 

905 910 915 

Lys Trp Met Ala lie Glu Ser Leu Phe Asp His lie Tyr Thr Thr 

920 925 930 

Gin Ser Asp Val Trp Ser Phe Gly Val Leu Leu Trp Glu lie Val 

935 940 945 

Thr Leu Gly Gly Asn Pro Tyr Pro Gly lie Pro Pro Glu Arg Leu 

950 955 960 

Phe Asn Leu Leu Lys Thr Gly His Arg Met Glu Arg Pro Asp Asn 

965 970 975 

Cys Ser Glu Glu Met Tyr Arg Leu Met Leu Gin Cys Trp Lys Gin 

980 985 990 

Glu Pro Asp Lys Arg Pro Val Phe Ala Asp lie Ser Lys Asp Leu 

995 1000 1005 

Glu Lys Met Met Val Lys Arg Arg Asp Tyr Leu Asp Leu Ala Ala 

1010 1015 1020 

Ser Thr Pro Ser Asp Ser Leu lie Tyr Asp Asp Gly Leu Ser Glu 

1025 1030 1035 

Glu Glu Thr Pro Leu Val Asp Cys Asn Asn Ala Pro Leu Pro Arg 

1040 1045 1050 

Ala Leu Pro Ser Thr Trp lie Glu Asn Lys Leu Tyr Gly Met Ser 

1055 1060 1065 

Asp Pro Asn Trp Pro Gly Glu Ser Pro Val Pro Leu Thr Arg Ala 

1070 1075 1080 

Asp Gly Thr Asn Thr Gly Phe Pro Arg Tyr Pro Asn Asp Ser Val 

1085 1090 1095 

Tyr Ala Asn Trp Met Leu Ser Pro Ser Ala Ala Lys Leu Met Asp 

1100 1105 1110 

Thr Phe Asp Ser 



<210> 20 

<211> 567 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> mis cofeature 

<223> Incyte ID No: 7483117CD1 

<400> 20 

Met Asp Asp Lys Asp lie Asp Lys Glu Leu Arg Gin Lys Leu Asn 
15 10 15 

Phe Ser Tyr Cys Glu Glu Thr Glu lie Glu Gly Gin Lys Lys Val 

20 25 30 

Glu Glu Ser Arg Glu Ala Ser Ser Gin Thr Pro Glu Lys Gly Glu 

35 40 45 

Val Gin Asp Ser Glu Ala Lys Gly Thr Pro Pro Trp Thr Pro Leu 

50 55 60 

Ser Asn Val His Glu Leu Asp Thr Ser Ser Glu Lys Asp Lys Glu 

65 70 75 

Ser Pro Asp Gin lie Leu Arg Thr Pro Val Ser His Pro Leu Lys 

80 85 90 

Cys Pro Glu Thr Pro Ala Gin Pro Asp Ser Arg Ser Lys Leu Leu 
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Pro Ser Asp 
Val He Ser 
Leu Lys Leu 
Ala Leu Val 
Leu Phe Leu 
Olu Glu Ala 
Arg Cys Val 
Glu Phe Leu 
Val Tyr Lys 
Lys Arg Ser 
Ala Leu His 
His Val Val 
He He Gin 
He Ser Glu 
Leu Lys Asp 
His Asn Ser 
Phe He Cys 
Glu Val Glu 
Tyr Lys He 
Lys Val Glu 
Gin Glu Asp 
Gly Leu Thr 
Asn Gly Ala 
Val Pro Gin 
Met He Gin 
Ala Arg Asn 
Leu Gin Gin 
Glu Arg Glu 



95 

Ser Pro Ser 

110 
Pro Thr Gly 

125 
Thr Pro Ala 

140 
Asn He Asn 

155 
Gin Ser Gly 

170 
Gly Pro Glu 

185 
Leu Arg Glu 

200 
Glu Val Glu 

215 
Cys He Lys 

230 
Met Lys Thr 

245 
Glu Val Tyr 

260 
Arg Tyr Tyr 

275 
Asn Glu Tyr 

290 
Asn Thr Lys 

305 
He Leu Leu 

320 
Ser Met Val 

335 
His Lys Met 

350 
Asn Glu Ala 

365 
Gly Asp Leu 

380 
Glu Gly Asp 

395 
Tyr Arg His 

410 
He Ala Val 

425 
Ala Trp His 

440 
Glu Leu Ser 

455 
Pro Asp Ala 

470 
Thr Val Leu 

485 
Gin Leu Asn 

500 
Leu Arg Glu 



Thr Pro Lys 
Lys Leu Pro 
Pro Leu Lys 
Pro Phe Thr 
Gly Lys Arg 
Glu Gly Lys 
Thr Asn Met 
Lys He Gly 
Arg Leu Asp 
Phe Thr Glu 
Ala His Ala 
Ser Ser Trp 
Cys Asn Gly 
Ser Gly Asn 
Gin He Ser 
His Leu Asp 
Gin Ser Glu 
Asp Trp Phe 
Gly His Ala 
Ser Arg Phe 
Leu Pro Lys 
Ala Ala Gly 
His He Arg 
Glu Ser Phe 
Glu Gin Arg 
Arg Pro Ser 
Leu Glu Lys 
Ala Gin Gin 



100 

Thr Met Leu 
115 

Ser Arg Gly 
130 

Asp Glu Met 
145 

Pro Glu Ser 
160 

Lys He Arg 
175 

Gly Gly Leu 
190 

Ala Ser Arg 
205 

Val Gly Glu 
220 

Gly Cys Val 
235 

Leu Ser Asn 
250 

Val Leu Gly 
265 

Ala Glu Asp 
280 

Gly Ser Leu 
295 

His Phe Glu 

310 

Leu Gly Leu 
325 

He Lys Pro 
340 

Ser Ser Gly 

355 

Leu Ser Ala 
370 

Thr Ser He 
385 

Leu Ala Asn 
400 

Ala Asp He 
415 

Ala Glu Ser 

430 

Lys Gly Asn 
445 

Ser Ser Leu 
460 

Pro Ser Ala 

475 

Leu Gly Lys 
490 

Phe Lys Thr 
505 

Ala Gin Ser 



105 

Ser Arg Leu 
120 

Pro Lys His 
135 

Thr Ser Leu 
150 

Tyr Lys Lys 
165 

Gly Asp Leu 
180 

Pro Ala Lys 
195 

Tyr Glu Lys 
210 

Phe Gly Thr 
225 

Tyr Ala He 
240 

Glu Asn Ser 
255 

His His Pro 
270 

Asp His Met 
285 

Gin Ala Ala 
300 

Glu Pro Lys 
315 

Asn Tyr He 
330 

Ser Asn He 
345 

Val He Glu 

360 

Asn Val Met 
375 

Asn Lys Pro 
390 

Glu He Leu 
405 

Phe Ala Leu 
420 

Leu Pro Thr 

435 

Phe Pro Asp 
450 

Leu Lys Asn 
465 

Ala Ala Leu 

480 

Thr Glu Glu 
495 

Ala Thr Leu 
510 

Pro Gin Gly 
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Tyr Thr His 
Ser Arg Ser 
Ser Ser Phe 




515 
His Gly Asp 

530 
Thr Lys Arg 

545 

Thr Ser Oly 
560 



Thr Gly Val 
Leu Val Gly 
Olu Arg Glu 



520 

Ser Gly Thr 
535 

Oly Lys Ser 

550 

Pro Leu His 
565 
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525 

His Thr Gly 
540 

Ala Arg Ser 

555 



<210> 21 
<211> 2054 
<212> PRT 

<213> Homo sapiens 



<220> 

<221> xnisc_£eature 

<223> Incyte ID No: 7484498CD1 



<400> 21 
Met Leu Lys 
1 

Ala Ala Glu 
Phe Gin Gly 
Leu Ser Arg 
Glu Cys Ser 
Phe Val Arg 
Gin Pro Ser 
Gly His Phe 
Asp lie Tyr 
Gin Glu Gin 
Arg Ser Thr 
Asp Lys Asn 
Asp Leu Leu 
Asn Leu lie 
Ser Val His 
Asn lie Leu 
Gly Ser Ala 
Leu Pro lie 
Val Met Asn 



Phe Lys Tyr 
5 

Pro lie Ala 
20 

Lys Pro Pro 
35 

Glu Gly He 

50 

Gin Pro Ala 
65 

Lys Tyr Ser 
80 

Ala Lys Asp 

95 

Ala Glu Val 

110 
Ala Met Lys 

125 
Val Ser Phe 

140 
Ser Pro Trp 

155 
His Leu Tyr 

170 
Ser Leu Leu 

185 

Gin Phe Tyr 

200 
Leu Met Gly 

215 
Val Asp Arg 

230 
Ala Lys Met 

245 
Gly Thr Pro 

260 
Gly Asp Gly 

275 



Gly Ala Arg 
Ser Arg Ala 
Phe Met Thr 
Leu Asp Ala 
Leu Met Lys 
Asp Thr He 
Phe Glu Val 
Gin Val Val 
Val Met Lys 
Phe Glu Glu 
He Pro Gin 
Leu Val Met 
Asn Arg Tyr 
Leu Ala Glu 
Tyr Val His 
Thr Gly His 
Asn Ser Asn 
Asp Tyr Met 
Lys Gly Thr 



Asn Pro Leu 
10 

Ser Arg Leu 
25 

Gin Gin Gin 
40 

Leu Phe Val 

55 

He Lys His 
70 

Ala Glu Leu 
85 

Arg Ser Leu 
100 

Arg Glu Lys 
115 

Lys Lys Ala 
130 

Glu Arg Asn 

145 

Leu Gin Tyr 
160 

Glu Tyr Gin 
175 

Glu Asp Gin 

190 

Leu He Leu 
205 

Arg Asp He 
220 

He Lys Leu 
235 

Lys Met Val 
250 

Ala Pro Glu 
265 

Tyr Gly Leu 
280 



Asp Ala Gly 
15 

Asn Leu Phe 
30 

Met Ser Pro 
45 

Leu Phe Glu 

60 

Val Ser Asn 
75 

Gin Glu Leu 
90 

Val Gly Cys 

105 

Ala Thr Gly 
120 

Leu Leu Ala 
135 

He Leu Ser 
150 

Ala Phe Gin 
165 

Pro Gly Gly 
180 

Leu Asp Glu 

195 

Ala Val His 
210 

Lys Pro Glu 
225 

Val Asp Phe 
240 

Asn Ala Lys 
255 

Val Leu Thr 
270 

Asp Cys Asp 

285 
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Trp Trp Ser 
Ser Pro Phe 
Met Asn Phe 
Ser Ser Asp 
Lys Glu Arg 
Ser Lys lie 
Val Pro Thr 
Pro Glu Lys 
Pro Ser Gly 
Tyr Ser Lys 
Ser Gly Leu 
Leu Leu lie 
His Lys Met 
Glu Val Glu 
Ser Glu Thr 
He Thr Glu 
Met Glu Val 
Asp lie Arg 
Glu Tyr Gin 
Leu Glu Glu 
Glu Ser Glu 
Arg Lys Ala 
Gin Gly Lys 
Asn Ala Glu 
Glu Lys Ala 
Asn He Arg 
Leu Gin Asn 
Val Glu Ala 



Val Gly Val 

290 
Ala Glu Gly 

305 
Gin Arg Phe 

320 
Phe Leu Asp 

335 
Leu Lys Phe 

350 
Asp Trp Asn 

365 
Leu Lys Ser 

380 
Asn Ser Trp 

395 
Phe Ser Gly 

410 
Ala Leu Gly 

425 
Asp Ser Pro 

440 
Lys Ser Lys 

455 
Glu Gin Glu 

470 

« 

Ala Val Leu 

485 
Gin Arg Ser 

500 
Cys Ser Ser 

515 
Ser Gin Glu 

530 
Glu Gin Ser 

545 
Ala Gin Val 

560 
Asp Leu Val 

575 
Leu Arg Glu 

590 
Thr Glu Cys 

605 
Pro Glu Val 

620 
Gin Gin Leu 

635 
Val Lys Ala 

650 
Gin Ala Lys 

665 
Arg Glu Asp 

680 
Glu Glu Arg 

695 



He Ala Tyr 
Thr Ser Ala 
Leu Lys Phe 
Leu lie Gin 
Glu Gly Leu 
Asn He Arg 
Asp Asp Asp 
Val Ser Ser 
Glu Glu Leu 
He Leu Gly 
Ala Lys Thr 
Glu Leu Gin 
Met Thr Arg 
Ser Gin Lys 
Leu Leu Glu 
Leu Lys Arg 
Asp Asp Lys 
Arg Lys Leu 
Glu Glu Met 
Ser Ala Arg 
Ser Arg Leu 
Gin His Lys 
Gly Glu Tyr 
Lys He Gin 
Ser Thr Glu 
Glu Arg Ala 
Ser Ser Glu 
Arg His Ser 



Glu Met He 

295 

Arg Thr Phe 
310 

Pro Asp Asp 
325 

Ser Leu Leu 
340 

Cys Cys His 
355 

Asn Ser Pro 
370 

Thr Ser Asn 
385 

Ser Pro Cys 
400 

Pro Phe Val 
415 

Arg Ser Glu 
430 

Ser Ser Met 
445 

Asp Ser Gin 
460 

Leu His Arg 
475 

Glu Val Glu 
490 

Gin Asp Leu 
505 

Ser Leu Glu 

520 

Ala Leu Gin 
535 

Gin Glu He 
550 

Arg Leu Met 
565 

Arg Arg Ser 
580 

Ala Ala Glu 

595 

Leu Leu Lys 
610 

Ala Lys Leu 
625 

Glu Leu Gin 
640 

Ala Thr Glu 
655 

Glu Arg Glu 
670 

Gly He Arg 
685 

Leu Glu Asn 
700 



Tyr Gly Arg 

300 

Asn Asn He 
315 

Pro Lys Val 
330 

Cys Gly Gin 
345 

Pro Phe Phe 
360 

Pro Pro Phe 
375 

Phe Asp Glu 

390 

Gin Leu Ser 
405 

Gly Phe Ser 
420 

Ser Val Val 
435 

Glu Lys Lys 
450 

Asp Lys Cys 
465 

Arg Val Ser 
480 

Leu Lys Ala 
495 

Ala Thr Tyr 
510 

Gin Ala Arg 
525 

Leu Leu His 
540 

Lys Glu Gin 

555 

Met Asn Gin 
570 

Asp Leu Tyr 
585 

Glu Phe Lys 

600 

Ala Lys Asp 
615 

Glu Lys He 
630 

Glu Lys Leu 

645 

Leu Leu Gin 
660 

Leu Glu Lys 
675 

Lys Lys Leu 

690 

Lys Val Lys 
705 
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Arg Leu Glu Thr Met Qlu Arg Arg Glu Asn Arg Leu Lys Asp Asp 

710 715 720 

lie Oln Thr Lys Ser Oln Qln lie Gin Oln Met Ala Asp Lys lie 

725 730 735 

Leu Glu Leu Glu Glu Lys His Arg Glu Ala Gin Val Ser Ala Oln 

740 745 750 

His Leu Glu Val His Leu Lys Gin Lys Glu Gin His Tyr Glu Glu 

755 760 765 

Lys lie Lys Val Leu Asp Asn Gin lie Lys Lys Asp Leu Ala Asp 

770 775 780 

Lys Glu Thr Leu Glu Asn Met Met Gin Arg His Glu Glu Glu Ala 

785 790 795 

His Glu Lys Gly Lys lie Leu Ser Glu Gin Lys Ala Met lie Asn 

800 805 810 

Ala Met Asp Ser Lys lie Arg Ser Leu Glu Gin Arg lie Val Glu 

815 820 825 

Leu Ser Glu Ala Asn Lys Leu Ala Ala Asn Ser Ser Leu Phe Thr 

830 835 840 

Gin Arg Asn Met Lys Ala Gin Glu Glu Met lie Ser Glu Leu Arg 

845 850 855 

Gin Gin Lys Phe Tyr Leu Qlu Thr Gin Ala Gly Lys Leu Glu Ala 

860 865 870 

Gin Asn Arg Lys Leu Glu Glu Gin Leu Glu Lys lie Ser His Oln 

875 880 885 

Asp His Ser Asp Lys Asn Arg Leu Leu Glu Leu Glu Thr Arg Leu 

890 895 900 

Arg Glu Val Ser Leu Glu His Glu Glu Gin Lys Leu Glu Leu Lys 

905 910 915 

Arg Gin Leu Thr Glu Leu Gin Leu Ser Leu Gin Glu Arg Glu Ser 

920 925 930 

Gin Leu Thr Ala Leu Gin Ala Ala Arg Ala Ala Leu Glu Ser Gin 

935 940 945 

Leu Arg Gin Ala Lys Thr Glu Leu Glu Glu Thr Thr Ala Glu Ala 

950 955 960 

Glu Glu Glu lie Oln Ala Leu Thr Ala His Arg Asp Glu lie Gin 

965 970 975 

Arg Lys Phe Asp Ala Leu Arg Asn Ser Cys Thr Val lie Thr Asp 

980 985 990 

Leu Glu Glu Gin Leu Asn Gin Leu Thr Glu Asp Asn Ala Glu Leu 

995 1000 1005 

Asn Asn Oln Asn Phe Tyr Leu Ser Lys Oln Leu Asp Glu Ala Ser 

1010 1015 1020 

Gly Ala Asn Asp Glu lie Val Oln Leu Arg Ser Glu Val Asp His 

1025 1030 1035 

Leu Arg Arg Glu lie Thr Glu Arg Glu Met Gin Leu Thr Ser Gin 

1040 1045 1050 

Lys Oln Thr Met Glu Ala Leu Lys Thr Thr Cys Thr Met Leu Glu 

1055 1060 1065 

Glu Gin Val Met Asp Leu Glu Ala Leu Asn Asp Glu Leu Leu Qlu 

1070 1075 1080 

Lys Glu Arg Gin Trp Glu Ala Trp Arg Ser Val Leu Gly Asp Glu 

1085 1090 1095 

Lys Ser Oln Phe Glu Cys Arg Val Arg Glu Leu Oln Arg Met Leu 

1100 1105 1110 

Asp Thr Glu Lys Oln Ser Arg Ala Arg Ala Asp Oln Arg lie Thr 

1115 1120 1125 
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Glu Ser Arg Gin Val Val Glu Leu Ala Val Lys Glu His Lys Ala 

1130 1135 1140 

Glu lie Leu Ala Leu Gin Gin Ala Leu Lys Glu Gin Lys Leu Lys 

1145 1150 1155 

Ala Glu Ser Leu Ser Asp Lys Leu Asn Asp Leu Glu Lys Lys His 

1160 1165 1170 

Ala Met Leu Glu Met Asn Ala Arg Ser Leu Gin Gin Lys Leu Glu 

1175 1180 1185 

Thr Glu Arg Glu Leu Lys Gin Arg Leu Leu Glu Glu Gin Ala Lys 

1190 1195 1200 

Leu Gin Gin Gin Met Asp Leu Gin Lys Asn His He Phe Arg Leu 

1205 1210 1215 

Thr Gin Gly Leu Gin Glu Ala Leu Asp Arg Ala Asp Leu Leu Lys 

1220 1225 1230 

Thr Glu Arg Ser Asp Leu Glu Tyr Gin Leu Glu Asn He Gin Val 

1235 1240 1245 

Leu Tyr Ser His Glu Lys Val Lys Met Glu Gly Thr He Ser Gin 

1250 1255 1260 

Gin Thr Lys Leu lie Asp Phe Leu Gin Ala Lys Met Asp Gin Pro 

1265 1270 1275 

Ala Lys Lys Lys Lys Val Pro Leu Gin Tyr Asn Glu Leu Lys Leu 

1280 1285 1290 

Ala Leu Glu Lys Glu Lys Ala Arg Cys Ala Glu Leu Glu Glu Ala 

1295 1300 1305 

Leu Gin Lys Thr Arg He Glu Leu Arg Ser Ala Arg Glu Glu Ala 

1310 1315 1320 

Ala His Arg Lys Ala Thr Asp His Pro His Pro Ser Thr Pro Ala 

1325 1330 1335 

Thr Ala Arg Gin Gin He Ala Met Ser Ala He Val Arg Ser Pro 

1340 1345 1350 

Glu His Gin Pro Ser Ala Met Ser Leu Leu Ala Pro Pro Ser Ser 

1355 1360 1365 

Arg Arg Lys Glu Ser Ser Thr Pro Glu Glu Phe Ser Arg Arg Leu 

1370 1375 1380 

Lys Glu Arg Met His His Asn He Pro His Arg Phe Asn Val Gly 

1385 1390 1395 

Leu Asn Met Arg Ala Thr Lys Cys Ala Val Cys Leu Asp Thr Val 

1400 1405 1410 

His Phe Gly Arg Gin Ala Ser Lys Cys Leu Glu Cys Gin Val Met 

1415 1420 1425 

Cys His Pro Lys Cys Ser Thr Cys Leu Pro Ala Thr Cys Gly Leu 

1430 1435 1440 

Pro Ala Glu Tyr Ala Thr His Phe Thr Glu Ala Phe Cys Arg Asp 

1445 1450 1455 

Lys Met Asn Ser Pro Gly Leu Gin Thr Lys Glu Pro Ser Ser Ser 

1460 1465 1470 

Leu His Leu Glu Gly Trp Met Lys Val Pro Arg Asn Asn Lys Arg 

1475 1480 1485 

Gly Gin Gin Gly Trp Asp Arg Lys Tyr He Val Leu Glu Gly Ser 

1490 1495 1500 

Lys Val Leu He Tyr Asp Asn Glu Ala Arg Glu Ala Gly Gin Arg 

1505 1510 1515 

Pro Val Glu Glu Phe Glu Leu Cys Leu Pro Asp Gly Asp Val Ser 

1520 1525 1530 

He His Gly Ala Val Gly Ala Ser Glu Leu Ala Asn Thr Ala Lys 



1535 



1540 



1545 
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Ala Asp 
Thr Cys 
Pro Asp 
Gly Gly 
Leu Gly 
Met Aan 
Thr Glu 
Thr His 
Lys Asp 
Leu Cys 
Ser His 
Ala Val 
Gly Leu 
Arg Tyr 
Glu Thr 
lie Leu 
Tyr Thr 
Pro Ala 
Val Gin 
Phe His 
Arg Thr 
Tyr Arg 
Val He 
Ala Tyr 
Ser Ser 
Arg Val 
Glu His 
Arg Gly 



Val Pro Tyr He Leu Lys Met Glu Ser His Pro His Thr 

1550 1555 1560 

Trp Pro Gly Arg Thr Leu Tyr Leu Leu Ala Pro Ser Phe 

1565 1570 1575 

Lys Gin Arg Trp Val Thr Ala Leu Glu Ser Val Val Ala 

1580 1585 1590 

Arg Val Ser Arg Glu Lys Ala Glu Ala Asp Ala Lys Leu 

1595 1600 1605 

Asn Ser Leu Leu Lys Leu Glu Gly Asp Asp Arg Leu Asp 

1610 1615 1620 

Cys Thr Leu Pro Phe Ser Asp Gin Val Val Leu Val Gly 

1625 1630 1635 

Glu Gly Leu Tyr Ala Leu Asn Val Leu Lys Asn Ser Leu 

1640 1645 1650 

Val Pro Gly He Gly Ala Val Phe Gin He Tyr He He 

1655 1660 1665 

Leu Glu Lys Leu Leu Met He Ala Gly Glu Glu Arg Ala 

1670 1675 1680 

Leu Val Asp Val Lys Lys Val Lys Gin Ser Leu Ala Gin 

1685 1690 1695 

Leu Pro Ala Gin Pro Asp He Ser Pro Asn He Phe Glu 

1700 1705 1710 

Lys Gly Cys His Leu Phe Gly Ala Gly Lys He Glu Asn 

1715 1720 1725 

Cys He Cys Ala Ala Met Pro Ser Lys Val Val He Leu 

1730 1735 1740 

Asn Glu Asn Leu Ser Lys Tyr Cys He Arg Lys Glu He 

1745 1750 1755 

Ser Glu Pro Cys Ser Cys He His Phe Thr Asn Tyr Ser 

1760 1765 1770 

He Gly Thr Asn Lys Phe Tyr Glu He Asp Met Lys Gin 

1775 1780 1785 

Leu Glu Glu Phe Leu Asp Lys Asn Asp His Ser Leu Ala 

1790 1795 1800 

Val Phe Ala Ala Ser Ser Asn Ser Phe Pro Val Ser He 

1805 1810 1815 

Val Asn Ser Ala Gly Gin Arg Glu Glu Tyr Leu Leu Cys 

1820 1825 1830 

Glu Phe Gly Val Phe Val Asp Ser Tyr Gly Arg Arg Ser 

1835 1840 1845 

Asp Asp Leu Lys Trp Ser Arg Leu Pro Leu Ala Phe Ala 

1850 1855 1860 

Glu Pro Tyr Leu Phe Val Thr His Phe Asn Ser Leu Glu 

1865 1870 1875 

Glu He Gin Ala Arg Ser Ser Ala Gly Thr Pro Ala Arg 

1880 1885 1890 

Leu Asp He Pro Asn Pro Arg Tyr Leu Gly Pro Ala He 

1895 1900 1905 

Gly Ala He Tyr Leu Ala Ser Ser Tyr Gin Asp Lys Leu 

1910 1915 1920 

He Cys Cys Lys Gly Asn Leu Val Lys Glu Ser Gly Thr 

1925 1930 1935 

His Arg Gly Pro Ser Thr Ser Arg Ser Ser Pro Asn Lys 

1940 1945 1950 

Pro Pro Thr Tyr Asn Glu His He Thr Lys Arg Val Ala 



1955 



1960 



1965 
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Ser Ser Pro Ala Pro Pro Glu Gly Pro Ser His Pro Arg Glu Pro 

1970 1975 1980 

Ser Thr Pro His Arg Tyr Arg Glu Gly Arg Thr Glu Leu Arg Arg 

1985 1990 1995 

Asp Lys Ser Pro Gly Arg Pro Leu Glu Arg Glu Lys Ser Pro Gly 

2000 2005 2010 

Arg Met Leu Ser Thr Arg Arg Glu Arg Ser Pro Gly Arg Leu Phe 

2015 2020 2025 

Glu Asp Ser Ser Arg Gly Arg Leu Pro Ala Gly Ala Val Arg Thr 

2030 2035 2040 

Pro Leu Ser Gin Val Asn Lys Val Trp Asp Gin Ser Ser Val 

2045 2050 



<210> 22 ■ 

<211> 1665 

<212> PRT 

<213> Homo sapiens 



<220> 

<221> inisc_f eature 

<223> Incyte ID No: 7638121CD1 



<400> 22 
Met Gly Cys 
1 

Ser Val Ser 
Gly Trp His 
Ser Leu Pro 
Glu Asp Val 
lie He Glu 
Ser Val Gin 
Gly Thr Thr 
Ala Gly Val 
Leu Cys Lys 
Asp Ser Glu 
Glu Val Lys 
Arg Val Gin 
He Pro Leu 
Asp He Leu 
Asp Gin Phe 
Cys Ser Ser 



Cys Arg Leu 
5 

Gin Gly Leu 
20 

Pro Gly Leu 
35 

Ala Leu Pro 
50 

Gin Ala Gin 
65 

Gly Asp Pro 

80 

Leu Val Asp 
95 

Tyr Ser Leu 

110 
Tyr Thr Cys 

125 
Ala Glu Leu 

140 
Lys Gin Ser 

155 
Glu Glu He 

170 
His Lys Gly 

185 
Arg Ser Arg 

200 
Ala Ala Leu 

215 
Glu Thr Arg 

230 
Glu Glu Leu 



Gly Cys Gly 
Thr Asn His 
Cys Gly Trp 
Gly Pro Pro 
Thr Gly Gly 
Gin Pro Ser 
Ser Thr Arg 
Val Leu Arg 
Leu Ala Gin 
Leu Val Leu 
His Arg Arg 
Gly Arg Gly 
Asn Lys He 
Thr Arg Ala 
Ser His Pro 
Lys Thr Leu 
Leu Asp Arg 



Gly Cys Ser 
10 

Pro Ser Met 
25 

Gly Gly Gly 

40 

Ser Met Gin 
55 

Thr Ala Gin 
70 

Val Thr Trp 
85 

Leu Ser Gin 
100 

His Met Ala 
115 

Asn Thr Gly 
130 

Gly Gly Asp 
145 

Lys Leu His 

160 

Val Phe Gly 
175 

Leu Cys Ala 
190 

Gin Ala Tyr 

205 

Leu Val Thr 
220 

He Leu He 
235 

Leu Tyr Arg 



Val Ala His 
15 

Val Gly Cys 
30 

Leu His Ser 
45 

Val Thr He 
60 

Phe Glu Ala 
75 

Tyr Lys Asp 
90 

Gin Gin Glu 
105 

Ser Lys Asp 
120 

Gly Gin Val 
135 

Asn Glu Pro 
150 

Ser Phe Tyr 

165 

Phe Val Lys 
180 

Ala Lys Phe 
195 

Arg Glu Arg 
210 

Gly Leu Leu 
225 

Leu Glu Leu 
240 

Lys Gly Val 
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Val Thr Glu 
Gly Leu His 
Lys Pro Ser 
Lys lie Cys 
Leu Gin Phe 
He He Gin 
Met Gly Val 
Ala Gly Glu 
Arg Val Ser 
Ala Lys Asp 
Arg Pro Ser 
Ser Met Pro 
Lys Phe Leu 
Tyr Lys Ser 
Gly Pro Pro 
Arg Asp Thr 
Glu Leu Ala 
Val Thr His 
Ser Ala Ser 
Glu Ala Pro 
Ala Ala Gin 
Phe Tyr His 
Pro Gly Ser 
Gly Gly Tyr 
Met Glu His 
Ala Thr Leu 
Leu Pro Ala 
Leu Glu His 



245 
Ala Glu Val 

260 
Tyr Leu His 

275 
Asn He Leu 

290 
Asp Phe Gly 

305 
Ser Gin Tyr 

320 
Gin Asn Pro 

335 
He Ser Tyr 

350 
Ser Asp Arg 

365 
Trp Ser Ser 

380 
Phe He Lys 

395 
Ala Ala Gin 

410 
Ala Glu Glu 

425 
Leu Ala Arg 

440 
He Leu Val 

455 
Asp Ser Pro 

470 
Gly Gly Ser 

485 
Pro Phe Ala 

500 
Ser Pro Leu 

515 
Leu Pro Glu 

530 
Ala Pro Pro 

545 
Gly Cys Val 

560 
Gin Ala Gly 

575 
Arg Arg His 

590 
He Ala Gly 

605 
Arg Val Leu 

620 
Leu Ala Lys 

635 
Ser Gly Thr 

650 
Asp Ser Pro 



Lys Val Tyr 
Ser His Gly 
Met Val His 
Phe Ala Gin 
Gly Ser Pro 
Val Ser Glu 
Leu Ser Leu 
Ala Thr Leu 
Pro Met Ala 
Ala Thr Leu 
Cys Leu Ser 
Ala His Phe 
Ser Arg Trp 
Met Arg Ser 
Ser Leu Gly 
Ser Ser Ser 
Arg Ala Lys 
Leu His Pro 
Olu Ala Glu 
Ala Ser Pro 
Pro Arg His 
Glu Ser Pro 
Pro Ala Arg 
Ala Leu Pro 
Glu Olu Glu 
Ala Pro Ser 
His Leu Ala 
Ser Thr Pro 



250 

He Gin Gin 
265 

Val Leu His 

280 

Pro Ala Arg 
295 

Asn He Thr 
310 

Glu Phe Val 

325 

Ala Ser Asp 
340 

Thr Cys Ser 
355 

Leu Asn Val 
370 

Ala His Leu 
385 

Gin Arg Ala 
400 

His Pro Trp 
415 

He Asn Thr 
430 

Gin Arg Ser 
445 

He Pro Glu 

460 

Val Ala Arg 
475 

Ser Ser Ser 
490 

Ser Leu Pro 

505 

Arg Gly Phe 
520 

Ala Ser Glu 
535 

Glu Gly Ala 
550 

Ser Val He 
565 

Glu His Gly 
580 

Arg Arg His 

595 

Gly Leu Arg 
610 

Ala Ala Arg 

625 

Phe Glu Thr 
640 

Pro Gly His 
655 

Arg Pro Ser 



255 

Leu Val Glu 
270 

Leu Asp He 

285 

Glu Asp He 
300 

Pro Ala Glu 
315 

Ser Pro Glu 

330 

He Trp Ala 
345 

Ser Pro Phe 
360 

Leu Glu Gly 
375 

Ser Glu Asp 
390 

Pro Gin Ala 
405 

Phe Leu Lys 
420 

Lys Gin Leu 
435 

Leu Met Ser 
450 

Leu Leu Arg 

465 

His Leu Cys 
480 

Ser Asp Asn 
495 

Pro Ser Pro 

510 

Leu Arg Pro 
525 

Arg Ser Thr 
540 

Gly Pro Pro 
555 

Arg Ser Leu 
570 

Ala Leu Ala 
585 

Leu Leu Lys 

600 

Glu Pro Leu 
615 

Glu Glu Gin 
630 

Ala Leu Arg 

645 

Ser His Ser 
660 

Ser Glu Ala 
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665 



670 



675 



Cys Gly Glu Ala Gin Arg Leu Pro Ser Ala Pro Ser Gly Gly Ala 

680 685 690 

Pro lie Arg Asp Met Gly His Pro Gin Gly Ser Lya Gin Leu Pro 

695 700 705 

Ser Thr Gly Gly His Pro Gly Thr Ala Gin Pro Glu Arg Pro Ser 

710 715 720 

Pro Asp Ser Pro Trp Gly Gin Pro Ala Pro Phe Cys His Pro Lys 

725 730 735 

Gin Gly Ser Ala Pro Gin Glu Gly Cys Ser Pro His Pro Ala Val 

740 745 750 

Ala Pro Cys Pro Pro Gly Ser Phe Pro Pro Gly Ser Cys Lys Glu 

755 760 765 

Ala Pro Leu Val Pro Ser Ser Pro Phe Leu Gly Gin Pro Gin Ala 

770 775 780 

Pro Leu Ala Pro Ala Lys Ala Ser Pro Pro Leu Asp Ser Lys Met 

785 790 795 

Gly Pro Gly Asp lie Ser Leu Pro Gly Arg Pro Lys Pro Gly Pro 

800 805 810 

Cys Ser Ser Pro Gly Ser Ala Ser Gin Ala Ser Ser Ser Gin Val 

815 820 825 

Ser Ser Leu Arg Val Gly Ser Ser Gin Val Gly Thr Glu Pro Gly 

830 835 840 

Pro Ser Leu Asp Ala Glu Gly Trp Thr Gin Glu Ala Glu Asp Leu 

845 850 855 

Ser Asp Ser Thr Pro Thr Leu Gin Arg Pro Gin Glu Gin Val Thr 

860 865 870 

Met Arg Lys Phe Ser Leu Gly Gly Arg Gly Gly Tyr Ala Gly Val 

875 880 885 

Ala Gly Tyr Gly Thr Phe Ala Phe Gly Gly Asp Ala Gly Gly Met 

890 895 900 

Leu Gly Gin Gly Pro Met Trp Ala Arg lie Ala Trp Ala Val Ser 

905 910 915 

Gin Ser Glu Glu Glu Glu Gin Glu Glu Ala Arg Ala Glu Ser Gin 

920 925 930 

Ser Glu Glu Gin Gin Glu Ala Arg Ala Glu Ser Pro Leu Pro Gin 

935 940 945 

Val Ser Ala Arg Pro Val Pro Glu Val Gly Arg Ala Pro Thr Arg 

950 955 960 

Ser Ser Pro Glu Pro Thr Pro Trp Glu Asp lie Gly Gin Val Ser 

965 970 975 

Leu Val Gin lie Arg Asp Leu Ser Gly Asp Ala Glu Ala Ala Asp 

980 985 990 

Thr lie Ser Leu Asp lie Ser Glu Val Asp Pro Ala Tyr Leu Asn 

995 1000 1005 

Leu Ser Asp Leu Tyr Asp lie Lys Tyr Leu Pro Phe Glu Phe Met 

1010 1015 1020 

lie Phe Arg Lys Val Pro Lys Ser Ala Gin Pro Glu Pro Pro Ser 

1025 1030 1035 

Pro Met Ala Glu Glu Glu Leu Ala Glu Phe Pro Glu Pro Thr Trp 

1040 1045 1050 

Pro Trp Pro Gly Glu Leu Gly Pro His Ala Gly Leu Glu lie Thr 

1055 1060 1065 

Glu Glu Ser Glu Asp Val Asp Ala Leu Leu Ala Glu Ala Ala Val 

1070 1075 1080 

Gly Arg Lys Arg Lys Trp Ser Ser Pro Ser Arg Ser Leu Phe His 
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1085 

Phe Pro Gly Arg His 

1100 

Leu Arg Olu Arg Val 

1115 

Leu Lys Oly Arg Pro 

1130 

Lys Lys Pro Gly Leu 

1145 

Trp Asp Arg Ala Pro 

1160 

Val Val Leu Oly Gin 

1175 

Gin Pro Ala Ala Gin 

1190 

Glu Ser Ser Ser Arg 

1205 

Gin Leu Leu Thr lie 

1220 

Tyr Thr Cys Ser Val 

1235 

Gly Val Leu Arg Lys 

1250 

Asp lie Gly Glu Val 

1265 

Pro Val Glu Ser Tyr 

1280 

Leu Glu Oly Gly Ser 

1295 

Cys Cys Tyr Leu Thr 

1310 

Phe Arg Thr Ala Cys 

1325 

Ser Pro Ser Glu Gin 

1340 

Ser Glu Glu Glu Ser 

1355 

Thr Lys Thr Phe Ala 

1370 

Ser Val Val Arg Gin 

1385 

Ala Ala Lys lie lie 

1400 

Leu Arg Glu Tyr Glu 

1415 

Ala Gin Leu His Ala 

1430 

lie Leu Glu Leu Cys 

1445 

Glu Arg Ala Ser Tyr 

1460 

Gin Met Leu Ser Ala 

1475 

His Leu Asp Leu Arg 

1490 

Leu Leu Lys Val Val 



1090 

Leu Pro Leu Asp Olu 

1105 

Lys Ala Ser Val Olu 

1120 

Glu Gly Leu Glu Lys 

1135 

Ala Ser Phe Arg Leu 

1150 

Thr Phe Leu Arg Olu 

1165 

Ser Val Thr Leu Ala 

1180 

Ala Thr Trp Ser Lys 

1195 

Val Leu lie Ser Ala 

1210 

Leu Val Val Val Ala 

1225 

Ser Asn Ala Leu Gly 

1240 

Ala Glu Arg Pro Ser 

1255 

Tyr Ala Asp Gly Val 

1270 

Gly Pro Val Thr Tyr 

1285 

Trp Thr Thr Leu Ala 

1300 

Ser Lys Leu Ser Arg 

1315 

Val Ser Lys Ala Gly 

1330 

Val Leu Leu Gly Oly 

1345 

Gin Gly Arg Ser Ala 

1360 

Phe Gin Thr Gin He 

1375 

Cys Trp Olu Lys Ala 

1390 

Pro Tyr His Pro Lys 

1405 

Ala Leu Lys Gly Leu 

1420 

Ala Tyr Leu Ser Pro 

1435 

Ser Gly Pro Glu Leu 

1450 

Ser Glu Ser Glu Val 

1465 

Thr Gin Tyr Leu His 

1480 

Ser Glu Asn Met He 

1495 

Asp Leu Oly Asn Ala 



1095 

Pro Ala Glu Leu Oly 

1110 

His He Ser Arg He 

1125 

Olu Oly Pro Pro Arg 

1140 

Ser Gly Leu Lys Ser 

1155 

Leu Ser Asp Glu Thr 

1170 

Cys Gin Val Ser Ala 

1185 

Asp Gly Ala Pro Leu 

1200 

Thr Leu Lys Asn Phe 

1215 

Olu Asp Leu Gly Val 

1230 

Thr Val Thr Thr Thr 

1245 

Ser Ser Pro Cys Pro 

1260 

Leu Leu Val Tzp Lys 

1275 

He Val Gin Cys Ser 

1290 

Ser Asp He Phe Asp 

1305 

Gly Oly Thr Tyr Thr 

1320 

Met Gly Pro Tyr Ser 

1335 

Pro Ser His Leu Ala 

1350 

Gin Pro Leu Pro Ser 

1365 

Gin Arg Oly Arg Phe 

1380 

Ser Oly Arg Ala Leu 

1395 

Asp Lys Thr Ala Val 

1410 

Arg His Pro His Leu 

1425 

Arg His Leu Val Leu 

1440 

Leu Pro Cys Leu Ala 

1455 

Lys Asp Tyr Leu Trp 

1470 

Asn Gin His He Leu 

1485 

He Thr Olu Tyr Asn 

1500 

Oln Ser Leu Ser Gin 
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Glu Lys 



Met Ala 



Glu Tyr 



Asp lie 



Ser Gly 



Leu Arg 



Leu Thr 



Arg Arg 



Pro Trp 



Phe Pro 



1505 1510 
Val Leu Pro Ser Asp Lys Phe Lys Asp Tyr Leu 

1520 1525 
Pro Glu Leu Leu Glu Gly Gin Gly Ala Val Pro 

1535 1540 
Trp Ala lie Gly Val Thr Ala Phe He Met Leu 

1550 1555 
Pro Val Ser Ser Glu Gly Ala Arg Asp Leu Gin 

1565 1570 
Lys Gly Leu Val Arg Leu Ser Arg Cys Tyr Ala 

1580 1585 
Gly Ala Val Ala Phe Leu Arg Ser Thr Leu Cys 

1595 1600 
Gly Arg Pro Cys Ala Ser Ser Cys Leu Gin Cys 

1610 1615 
Glu Glu Gly Pro Ala Cys Ser Arg Pro Ala Pro 

1625 1630 
Thr Ala Arg Leu Arg Val Phe Val Arg Asn Arg 

1640 1645 
Ala Leu Leu Tyr Lys Arg His Asn Leu Ala Gin 

1655 1660 



1515 
Glu Thr 

1530 
Gin Thr 

1545 
Ser Ala 

1560 
Arg Gly 

1575 
Gly Leu 

1590 
Ala Gin 

1605 
Pro Trp 

1620 
Val Thr 

1635 
Glu Lys 

1650 
Val Arg 

1665 



<210> 23 

<211> 1014 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 7482896CB1 

<400> 23 

atgacaaaca acagcggctc caaagccgaa ctcgttgtgg gagggaaata caaactggtg 60 
cggaagatcg ggtctggctc ctttggagac gtttatctgg gcatcaccac caccaacggc 120 
gaggacgtag cagtgaagct ggaatctcag aaggtcaagc acccccagtt gctgtatgag 180 
agcaaactct acacgattct tcaaggtggg gttggcatcc cccacatgca ctggtatggt 240 
caggaaaaag acaacaatgt gctagtcatg gaccttctgg gacccagcct cgaagacctc 300 
tttaatttct gttcaagaag gttcaccatg aaaactgtac ttatgttagc cgaccagatg 360 
atcagcagaa ttgaatacgt gcatacaaag aattttctac accgagacat taaaccagat 420 
aacttcctga tgggtactgg gcgtcactgt aataagttgt tccttattga ttttggtttg 480 
gccaaaaagt acagagacaa caggaccagg caacacatac cgtacagaga agataaacac 540 
ctcattggca ctgtccgata tgccagcatc aatgcacatc ttggtattga gcagagccgc 600 
cgagatgaca tggaatcctt aggctacgtt ttcatgtatt ttaatagaac cagcctgccg 660 
tggcaaggac taagggctat gacaaaaaaa caaaaatatg aaaagattag tgagaagaag 720 
atgtccaccc ctgttgaagt tttatgtaag gggtttcctg cagaattcgc catgtacttg 780 
aactactgtc gtgggctgcg ctttgaggaa gtcccagatt acatgtatct gaggcagcta 840 
ttccgcattc ttttcaggac cctgaaccac caatatgact acacatttga ttggacgatg 900 
ttaaagcaga aagcagcaca gcaggcagcc tcttccagtg ggcagggtca gcaggcccaa 960 
acccagacag gcaagcaaac tgaaaaaaac aagaataatg tgaaagataa ctaa 1014 

<210> 24 

<211> 1530 

<212> DNA 

<213> Homo sapiens 
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<220> 

<221> misc_f eature 

<223> Incyte ID No: 7483046CB1 



<400> 24 

cggcctgaca ggcgggcatg cgggcggcca 
cagccatgga gcggcggctg cgcgcgctgg 
gcccggggct cgacggcctc ctagatctgc 
gccccctacg gcgggagcgc agcgtggcgc 
caaaggtgaa agaactgcgt ctgcagagag 
gaggagcctt tggggaggtc accgtggtga 
tgaaaatgct gcacaagtgg gagatgctga 
agcgggatgt gctcgtgaaa ggggacagcc 
aagacgagga gtacctgtac cttgtgatgg 
tgctgagccg cttcgaggac cgtctcccgc 
tggtgctggc catccactcg ctgcaccagc 
acaacgtcct gctggatgtg aacgggcaca 
gtctcaacac caacggcatg gtggattcat 
cccctgagat cctgcaggcc atggaggagg 
.ggtggtcgct tggagtctgc gcctatgagc 
agtccttggt ggaaacctac ggcaagatca 
cggacgtgcc tgacgtgcca gccagcgccc 
aggaagagcg gctaggccgt ggtgggctgg 
gcgtggactg ggagcggctg gcgagcagca 
ccatggacac ctccaactt^ gatgtggatg 
caccgccctc ccacggggcc ttctccggcc 
cctcaggcag tcacagtcct gagagcagct 
tccagtgtct ggagcaggag aaggtggagc 
cccccacaga ccatcgggag ctggagcagc 
ggctgccagg tatcccttcc gcccaccccc 
tatttgcaaa tccaaaaaaa aaaaaaaagg 



gactgtagcc gagcagcgag gctccggccg 60 
agcagctggc gcggggcgag gccggcggct 120 
tgctggcgct gcaccacgag ctcagcagcg 180 
agttcctgag ctgggccagc cccttcgtat 240 
atgactttga gatcttgaag gtgatcggcc 300 
ggcagaggga cactgggcag atttttgcca 360 
agagggctga gacagcctgt ttccgggagg 420 
gttgggtgac cactctgcac tatgccttcc 480 
actactatgc tggtggggac ctcctgacgc 540 
ccgagctggc ccagttctac ctggctgaga 600 
tgggttatgt ccacagggat gtcaagccag 660 
ttcgcctggc tgacttcggc tcctgcctgc 720 
cagtggcagt agggacgccg gactatatct 780 
gcaagggcca ctacggccca cagtgtgact 840 
tgctctttgg ggagacgccc ttctatgctg 900 
tgaaccacga ggaccacctg cagttccccc 960 
aagacctgat ccgccagctg ctgtgtcgcc 1020 
atgacttccg gaaccatcct ttcttcgaag 1080 
cggcccccta tattcctgag ctgcgggggc 1140 
acgacaccct caaccatcca gggaccctgc 1200 
atcacctgcc attcgtgggc ttcacctaca 1260 
ctgaggcttg ggctgccctg gagcggaagc 1320 
tgagcaggaa gcaccaagag gccctgcacg 1380 
tacggaagga agtgcagact ctgcgggaca 1440 
accctctcct tgagtttctg tgaattaaaa 1500 

1530 



<210> 25 

<211> 3150 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc^f eature 

<223> Incyte ID No: 71636374CB1 

<400> 25 

attggcttat aggaaaaatt gatttataaa 
acaacatccc atatgaatgg gcatgttaca 
gatcttgcat caccagagga acatcagaag 
gatttgggca ccaggatgat gccaatacgt 
caacaggagg acatgaggcg taggagagag 
aattcttcca tgagacttaa gaaactagcc 
cctatgtttg atacagagga aggaattgtc 
ttagaaatag aagacttgtt ttcttcactt 
cagagccagg aggatatttc actgctttta 
gcatttaaga tacacaatgc catcacagta 
cttatctcca acgcacaaga tcttgctcaa 
cataaggaag gacaagaact aactgctttg 
ctggcccacg ataaggttgc tgagcaggaa 
gtttatgaaa gtattggcca gtatggagga 



aagtggtaca ggttttcata gataaccatg 60 
gaggaatcag acagcgaagt aaaaaatgtt 120 
caccgagaga tggctgttga ctgccctgga 180 
cgaagtgcac agttggagcg tattcggcaa 240 
gaagaaggga aaaagcaaga acttgacctt 300 
caaattcctc caaagaccgg aatagataac 360 
ttagaaagtc ctcattatgc tgtgaaaata 420 
aaacatatcc aacatacttt ggtagattct 480 
caacttgttc aaaataagga tttccagaat 540 
cacatgaaca aggccagtcc tccatttcct 600 
gaggtacaaa ctgttttgaa gccagttcat 660 
ctgaatactc cacatattca ggcactttta 720 
atgcagctag agcccattac agatgagaga 780 
gaaactgtaa aaatagttcg tatagaaaag 840 



52/75 



wo 02/33099 





PCT/USOl/47728 



gctcgtgata ttccgttggg tgctacagtt cgtaatgaaa tggactctgt catcattagc 900 
cggatagtaa aagggggtgc tgcagagaaa agtggtctgt tgcatgaagg agatgaagtt 960 
ctagagatta atggcattga aattcggggg aaagatgtca atgaggtttt tgacttgttg 1020 
tctgatatgc atggtacttt gacttttgtc ctgattccca gtcaacagat caagccgcct 1080 
cctgccaagg aaacagtaat ccatgtaaaa gctcattttg actatgaccc ctcagatgac 1140 
ccttatgttc catgtcgaga gttaggtctg tcttttcaaa aaggtgatat acttcatgtg 1200 
atcagtcaag aagatccaaa ctggtggcag gcctacaggg aaggggacga agataatcaa 1260 
cctctagccg ggcttgttcc agggaaaagc tttcagcagc aaagggaagc catgaaacaa 1320 
accatagaag aagataagga gccagaaaaa tcaggaaaac tgtggtgtgc aaagaagaat 1380 
aaaaagaaga ggaaaaaggt tttatataat gccaataaaa atgatgatta tgacaacgag 1440 
gagatcttaa cctatgagga aatgtcactt tatcatcagc cagcaaatag gaagagacct 1500 
atcatcttga ttggtccaca gaactgtggc cagaatgaat tgcgtcagag gctcatgaac 1560 
aaagaaaagg accgctttgc atctgcagtt cctcatacaa cccggagtag gcgagaccaa 1620 
gaagtagccg gtagagatta ccactttgtt tcgcggcaag cattcgaggc agacatagca 1680 
gctggaaagt tcattgagca tggtgaattt gagaagaatt tgtatggaac tagcatagat 1740 
tctgtacggc aagtgatcaa ctctggcaaa atatgtcttt taagtcttcg tacacagtca 1800 
ttgaagactc tccggaattc agatttgaaa ccatatatta tcttcattgc acccccttca 1860 
caagaaagac ttcgggcatt attggccaaa gaaggcaaga atccaaagcc tgaagagttg 1920 
agagaaatca ttgagaagac aagagagatg gagcagaaca atggccacta ctttgatacg 1980 
gcaattgtga attccgatct tgataaagcc tatcaggaat tgcttaggtt aattaacaaa 2040 
cttgatactg aacctcagtg ggtaccatcc acttggctga ggtgaaagaa acatccattc 2100 
tgtggcatgt tggacttgat ctggcaaaaa ctgccaatag gaggactgcc cgacactgca 2160 
gcaagattga ggataagatg gaaggcagca gtataagctg tagatctgtt cttagatctc 2220 
ttgaattagt gagacgacag ttcccttagg cagtttgtgc atggcatcct ttattctcta 2280 
tacatggctt tagcggttct tgcctcattt tgggattcta aatggaagct ttcaacagag 2340 
cattccattt tgtcctgtta aaaccttttg ttttcaccta aaccctttct gcttagttgt 2400 
atctctgtga aaaacttgta tacacaagcg tccatgtctc acacaaatat tgatgtgatt 2460 
attcttaagt gttaaatcat taacacttaa atgacttcat tgggaatatt gagcagaggg 2520 
actgtgcttc tatgcactgg gcaaggcagt atttgcttag gaaactaatt tagtcatcag 2580 
agatactttc ctaaaaagga aaaataaaaa acaaaatggt gccactttgg gttgaagcta 2640 
ctttgttagg cttgaattca tttatatgtc ttttgattct taaaaaaaca aaaaacattc 2700 
cattagaagc accagttttt ttgctcagac tttgtggatc agactctaca ctcaacacac 2760 
tctaatctac ttaaaggtat acaaaatatg ctgatctttt ttaaattatg atttcctgaa 2820 
tttttttctt aagtcgtctc aactgattta ctcacttagc ttcccttccc tcatcagcat 2880 
agtataatag aatgtatgtt acatttttat gaatggcagg tgttcattat aatctgtatt 2940 
gacttaaaaa gtttcttcct catgatgcta atagtttttt gtatacatgg gaggatagca 3000 
catttgacag tttttgcatt tttatgtatg agcacagtat cctatgactg tgctacgtat 3060 
atataggtaa taaactggaa ttctgttgat gaatatagct gctgtactgt atattaatat 3120 
ttaatagatc aacaaatggt cattgaaaac 3150 

<210> 26 

<211> 2901 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc^feature 

<223> Incyte ID No: 7480597CB1 



atggcggaag gcaaggaagg gcaagtccca tcttacatgg atggcagcag gcaaagagag 60 
aatgaggaag atgcaaaagc ggaaacccct gatgtaacca tcagatctta tgagatttat 120 
tcactaccat ggaacagaca gcaaggccta tgtgaccatt ctctaaaata tttaagctcg 180 
agaatcacag agcggaagct gcaaggctcc tggctgcctg ccagccgagg gaatctggag 240 
aaaccattcc tggggccgcg tggccccgtc gtgcccttgt tctgccctcg gaatggcctt 300 
cactcagcac atcctgagaa cagccctctg aagcccaggg tcgtgaccgt agtgaagctg 360 



<400> 26 



53/75 



wo 02/33099 PCTAJSOl/47728 





99tgggcagc gcccccgaaa gatcactctg ctcctcaaca ggcgatcagt gcagacgttc 420 
gagcagctct tagctgacat ctcagaagcc ttgggctctc ccagatggaa gaatgaccgt 480 
gtgaggaaac tgtttaacct caagggcagg gaaatcagga gcgtctctga tttcttcagg 540 
gaaggggatg ctttcatagc tatgggcaaa gaaccactga cactgaagag cattcaggtg 600 
gctgtagaag aactgtaccc caacaaagcc cgggccctga cactggccca gcacagccgt 660 
gccccttctc caaggctgag gagcaggctg tttagcaagg ctctgaaagg agaccaccgc 720 
tgtggggaga ccgagacccc caagagctgc agcgaagttg caggatgcaa ggcagccatg 780 
aggcaccagg ggaagatccc cgaggagctt tcactagatg acagagcgag gacccagaag 840 
aagtggggga gggggaaatg ggagccagaa cccagtagca agccccccag ggaagccact 900 
ctggaagaga ggcacgcaag gggagagaag catcttgggg tggagattga aaagacctcg 960 
ggtgaaatta tcagatgcga gaagtgcaag agagagaggg agctccagca gagcctggag 1020 
cgtgagaggc tttctctggg gaccagtgag ctggatatgg ggaagggccc aatgtatgat 1080 
gtggagaagc tggtgaggac cagaagctgc aggaggtctc ccgaggcaaa tcctgcaagt 1140 
ggggaggaag ggtggaaggg tgacagccac aggagcagcc ccaggaatcc cactcaagag 1200 
ctgaggagac ccagcaagag catggacaag aaagaggaca gaggcccaga ggatcaagaa 1260 
agccatgctc agggagcagc caaggccaag aaggaccttg tggaagttct tcctgtcaca 1320 
gaggaggggc tgagggaggt gaagaaggac accaggccca tgagcaggag caaacatggt 1380 
ggctggctcc tgagagagca ccaggcgggc tttgagaagc tccgcaggac ccgaggagaa 1440 
gagaaggagg cagagaagga gaaaaagcca tgtatgtctg gaggcagaag gatgactctc 1500 
agagatgacc aacctgcaaa gctagaaaag gagcccaaga cgaggccaga agagaacaag 1560 
ccagagcggc ccagcggtcg gaagccacgg cccatgggca tcattgccgc caatgtggaa 1620 
aagcattatg agactggccg ggtcattggg gatgggaact ttgctgtcgt gaaggagtgc 1680 
agacaccgcg agaccaggca ggcctatgcg atgaagatca ttgacaagtc cagactcaag 1740 
ggcaaggagg acatggtgga cagtgagatc ttgatcatcc agagcctctc tcaccccaac 1800 
atcgtgaaat tgcatgaagt ctacgaaaca gacatggaaa tctacctgat cctggagtac 1860 
gtgcagggag gagacctttt tgacgccatc atagaaagtg tgaagttccc ggagcccgat 1920 
gctgccctca tgatcatgga cttatgcaaa gccctcgtcc acatgcacga caagagcatt 1980 
gtccaccggg acctcaagcc ggaaaacctt ttggttcagc gaaatgagga caaatctact 2040 
accttgaaat tggctgattt tggacttgca aagcatgtgg tgagacctat atttactgtg 2100 
tgtgggaccc caacttacgt agctcccgaa attctttctg agaaaggtta tggactggag 2160 
gtggacatgt gggctgctgg cgtgatcctc tatatcctgc tgtgtggctt tcccccattc 2220 
cgcagccctg agagggacca ggacgagctc tttaacatca tccagctggg ccactttgag 2280 
ttcctccccc cttactggga caatatctct gatgctgcta aagatctggt gagccggttg 2340 
ctggtggtag accccaaaaa gcgctacaca gctcatcagg ttcttcagca cccctggatc 2400 
gaaacagctg gcaagaccaa tacagtgaaa cgacagaagc aggtgtcccc cagcagcgag 2460 
ggtcacttcc ggagccagca caagagggtt: gtggagcagg tatcatagtc accaccttgg 2520 
gaatctgtcc agcccccagt tctgctcaag gacagagaaa aggatagaag tttgagagaa 2580 
aaacaatgaa agaggcttct tcacataatt ggtgaatcag agggagagac actgagtata 2640 
ttttaaagca tattaaaaaa attaagtcaa tgttaaatgt cacaacatat ttttagattt 2700 
gtatatttaa agcctttaat acatttttgg ggggtaagca ttgtcatcag tgaggaattt 2760 
tggtaataat gatgtgtttt gcttcccctt tgtaaccaag tttattctgt actacaggag 2820 
tggtgcttac cagggtctaa actccccctg tgagattaat aaggtgcatt gtggtctttc 2880 
tgtgttaata aaatgtggtc c 2901 

<210> 27 

<211> 1671 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc^f eature 

<223> Incyte ID No: 3227248CB1 



<400> 27 

atgaagctta taaatggcaa aaagcaaaca ttcccatggt' ttggcatgga catcggtgga 60 
acgctggtta aattggtgta tttcgagccg aaggatatta cagccgaaga ggagcaagag 120 



54/75 



wo 02/33099 



PCT/USOl/47728 



gaagtggaga acctgaagag catccggaag 
actgggatcc gagacgtcca cctggaactg 
aacctgcact tcatccgctt tcccagctgt 
gagaagaact tctctagcct tcacaccacc 
ttcgaagagg acttcagaat gattgctgac 
tgtctgattc agggcctgct ttatgtcgac 
tactattttg aaaatcccac aaatcctgaa 
aacccatacc ctatgttgct ggttaacatg 
tccaaggaca actataaaag agttacaggg 
ctatgttgct tgctgactgg ttgtgagacc 
ggcgacagca ccaatgttga taaactggtg 
tttggccttc aaggatctgc tgtagcatca 
cgagattcca tcagcaagga agacctcgcc 
attggctcca ttgctcggat gtgtgcgttg 
ggaaattttc tcagaatcaa tatggtctcc 
tggtccaaag gacaactgaa agctctgttt 
ggggcactgt tggaactgtt caaaatgact 
acagcctccc aaaaggacag agaactaaaa 
tgggacggaa gccaagccat tatggcagat 
tccttccaga tgatctttta ctcttaggtt 
aaaaggtttt ttttctgtat actgtatttt 
taccaattgt atgcattaac tttgaaaagt 
aagcgctggt catttttctt ctggggttta 
agtaattact ctaggagatt ttaccttgac 
tgttggtttc aaatgaaact acaaatctgg 
gttttgtacc ctttttgtct tgtttttctg 



tatttgactt ctaatactgc ttatgggaaa 180 
aaaaacctga ccatgtgtgg acgcaaaggg 240 
gctatgcaca ggttcattca gatgggcagc 300 
ctctgtgcca caggaggcgg ggctttcaaa 360 
ctgcagctgc ataaactgga tgaactggac 420 
tctgttggct tcaacggcaa gccagaatgt 480 
ttgtgtcaaa aaaagccgta ctgccttgat 540 
ggctcaggtg tcagcattct agccgtgtac 600 
accagtcttg gaggtggaac attcctaggc 660 
tttgaagaag ctctggaaat ggcagctaaa 720 
aaggacattt acggaggaga ctatgaacga 780 
agctttggca acatgatgag taaagaaaag 840 
cgggccacat tggtcaccat caccaacaac 900 
aatgagaaca tagacagagt tgtgtttgtt 960 
atgaagctgc tggcatatgc catggatttt 1020 
ttggaacatg agggttattt tggagccgtt 1080 
gatgataagt agagacgagc agtggaggaa 1140 
aattgctgct ggagaaggtg aaagtcgctt 1200 
gaacctgctg gatttgtaaa taatttaaaa 1260 
ttgagctaat gattcaaaac gggggaatat 1320 
tttaaaaaaa tggtgcagcg tggccaaacc 1380 
tgtttgatgt ttaagaagga cctgatatgt 1440 
ctgatcagtg tggtgatttt aacttcattt 1500 
ttatattttt catgacgttt catgatttgc 1560 
catgttttac tgtgaacact tttgttattt 1620 
ttttagttgt cttctgaaaa a 1671 



<210> 28 

<211> 2577 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> inisc_f eature 

<223> Incyte ID No: 4207273CB1 



<400> 28 

atgccacaga tagcaaagaa gcaatcaact 
tttccttgca tctgtaaaaa tccaggaaca 
ccgacagagc caagactaaa ttacctagat 
aattcaactg ctaatggacc tggaatctat 
gtgcgagaga ctgaaaggga tgaaaacacg 
ggcagacgta tcaccaataa atgtcgatct 
acaagacttt ctcagaaaaa aacacatatg 
caagagcaca aagtcttaat ttctaaagaa 
catgacattg aaaatggtga tggtatttca 
aatgagtttc tatcttccaa agatgaaatt 
cagtccatga aacagaatga attccctcct 
tctatggaag agtctactgg tgatagagac 
ctcagagatc tgcaagaact tgaagagcta 
gacagctggg cagtgcccag tgagaagaat 
aatacagcat ctcttagtaa agtaaatgcc 
gatagtgttt cagatcactc taaaacactt 
agtgcatctt cccagacata tcaatattgg 
aataagtcaa tcacatatca aatgtttgga 
caagaaatta tggactctgt aaataatgaa 



caccggactc agaaacctaa aaagcaatca 60 
cagaagtcat gtgttcctct ctctgttcaa 120 
cttaagtata gtgatatgtt caaagaaatc 180 
gaaatgtttg ggacccctgt ttattgtcat 240 
tattaccgtg agatatgttc ggctccatca 300 
tcacacagtg agaggaagag caatatcaga 360 
aaatgcccaa agacttcatt tggcattaaa 420 
aagagttcca aggctgtaca tagcaaccta 480 
gaaccagact ggcagataaa gtcttcagga 540 
catcccatga acttggctca gacacctgag 600 
gtctcagatt tatccattgt tgaagaagtt 660 
atttctaaca atcaaatact caccacaagc 720 
catcaccaga tcccatttat cccttcagaa 780 
tctaacaagt atgtacagca agaaaagcag 840 
agccgaattt taactaatga tctagagttt 900 
acaaatttct ctttccaagc aaaacaagaa 960 
gtacattatt tggatcatga tagtttagca 1020 
aaaaccttaa gtggcacaaa ttcaatttcc 1080 
gaattgacag atgaactatt aggttgtcta 1140 



55/75 
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gctgcagaat tattagctct tgatgagaaa gataacaact cttgccaaaa aatggcaaat 1200 
gaaacagatc ctgaaaacct aaatcttgtc ctcagatgga gaggaagtac cccaaaagaa 1260 
atgggcagag agacaacaaa agtcaaaata cagaggcata gtagtgggct caggatatat 1320 
gacagggagg agaaatttct catctcaaat gaaaagaaga tattttctga aaatagttta 1380 
aagtctgaag aacctatcct atggaccaag ggtgagattc ttggaaaggg agcctacggc 1440 
acagtatact gtggtctcac tagtcaagga cagctaatag ctgtaaaaca ggtggctttg 1500 
gatacctcta ataaattagc tgctgaaaag gaataccgga aactacagga agaagtagat 1560 
ttgctcaaag cactgaaaca tgtcaacatt gtggcctatt tggggacatg cttgcaagag 1620 
aacactgtga gcattttcat ggagtttgtt cctggtggct caatctctag tattataaac 1680 
cgttttgggc cattgcctga gatggtgttc tgtaaatata cgaaacaaat acttcaaggt 1740 
gttgcttatc tccatgagaa ctgtgtggta catcgcgata tcaaaggaaa taatgttatg 1800 
ctcatgccaa ctggaataat aaagctgatt gactttggct gtgccaggcg tttggcctgg 1860 
gcaggtttaa atggcaccca cagtgacatg cttaagtcca tgcatgggac tccatattgg 1920 
atggccccag aagtcatcaa tgagtctggc tatggacgga aatcagatat ctggagcatt 1980 
ggttgtactg tgtttgagat ggctacaggg aagcctccac tggcttccat ggacaggatg 2040 
gccgccatgt tttacatcgg agcacaccga gggctgatgc ctcctttacc agaccacttc 2100 
tcagaaaatg cagcagactt tgtgcgcatg tgcctgacca gggaccagca tgagcgacct 2160 
tctgctctcc agctcctgaa gcactccttc ttggagagaa gtcactgaat atacatcaag 2220 
actttcttcc cagttccact gcagatgctc ccttgcttaa ttgtggggaa tgatggctaa 2280 
gggatctttg tttccccact gaaaattcag tctaacccag tttaagcaga tcctatggag 2340 
tcattaactg aaagttgcag ttacatatta gcctcctcaa gtgtcagaca ttattactca 2400 
tagtatcaga aaacatgttc ttaataacaa caaaaaacta tttcagtgtt tacagttttg 2460 
attgtccagg aactacattc tctagtgttt tatatgacat ttctttttat ttttggcctg 2520 
tcctgtcaat tttaatgttg ttagtttaaa ataaattgta aaaacaaaaa aaaaaaa 2577 

« ■ 

<210> 29 
<211> 2110 
<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_f eature 

<223> Incyte ID No: 7483334CB1 

<400> 29 

ctagggtcgc cggggaagcg gtttgggaga 
ctgtgtggat gccccagcat ggatgactac 
ttcggcagag ctcttttggt tcagcttgaa 
ataaggcttc ccaagtcttt ctctaataca 
gccaaaatga aacaccctaa tattgttgcc 
ttgtatattg tgatggaata ctgtgatgga 
aaaggaaagt tatttcctga agacatgata 
gtaaatcaca ttcacaagaa acgtgtgcta 
ctcactcaga atggaaaagt gaaattggga 
ccgatggcat ttgcttgtac ctatgtggga 
gaaaacctgc cttataacaa taaaagtgac 
ctctgtaccc ttaagcatcc atttcaggca 
tgtcaagggt gcatcagtcc actgccgtct 
aagcagatgt ttaaaaggaa tccctcacat 
ggcatcgtag ctcggcttgt ccagaagtgc 
gaggaagtat tagaagaaat aaaaaattcg 
cccagcagaa tcaggatagc tttgggaaat 
gatagaaagg gtagccatac tgatttggaa 
ttgagaagag taaacagaga agaaaaaggt 
tcaccaaatc ttcatagacg acagtgggag 
ttggaaaatg catccatact cacctccagt 



gcccatggtg actgcgtgag tggagcccag 60 
atggtcctga gaatgattgg ggagggctcc 120 
agcagtaatc agatgtttgc catgaaagaa 180 
cagaattcta ggaaggaggc tgttctttta 240 
ttcaaagaat catttgaagc tgaaggacac 300 
ggggatctaa tgcaaaagat taaacagcag 360 
cttaattggt ttacccaaat gtgccttgga 420 
cacagagata tcaagtccaa gaatatcttc 480 
gactttggat ctgcccgtct tctctccaat 540 
actccttatt atgtgcctcc agaaatttgg 600 
atctggtcct tgggttgcat cctgtatgaa 660 
aatagttgga aaaatcttat cctcaaagta 720 
cattactcct atgaacttca gttcctagtc 780 
cgcccctcgg ctacaacgct tctctctcga 840 
ttaccccccg agatcatcat ggaatatggt 900 
aagcataaca caccaagaaa aaaaacaaac 960 
gaagcaagca cagtgcaaga ggaagaacaa 1020 
agcattaatg aaaatttagt tgaaagtgca 1080 
aataagtcag tccatctgag gaaagccagt 1140 
aaaaatgtac ccaatacagc tcttacagct 1200 
ttaacagcag aggacgatag aggtggttct 1260 



56/75 



wo 02/33099 
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gtaataaagt acagcaaaaa tactactcgt aagcagtggc tcaaagagac ccctgacact 1320 
ttgttgaaca tccttaagaa tgctgatctc agcttggctt ttcaaacata cacaatatat 1380 
agaccaggtt cagaagggtt cttgaaaggc cccctgtctg aagaaacaga agcatcggac 1440 
agtgttgatg gaggtcacga ttctgtcatt ttggatccag agcgacttga gcctgggcta 1500 
gatgaggagg acacggactt tgaggaggaa gatgacaacc ccgactgggt gtcagagctg 1560 
aagaagcgag ctggatggca aggcctgtgc gacagataat gcctgaggaa atgttcctga 1620 
gtcacgctga ggagagcctt cactcaggag ttcatgctga gatgatcatg agttcatgcg 1680 
acgtatattt tcctttggaa acagaatgaa gcagaggaaa ctcttaatac ttaaaatcgt 1740 
tcttgattag tatcgtgagt ttgaaaagtc tagaactcct gtaagttttt gaactcaagg 1800 
gagaaggtat agtggaatga gtgtgagcat cgggctttgc agtcccatag aacagaaatg 1860 
ggatgctagc gtgccactac ctacttgtgt gattgtggga aattacttaa cctcttcaag 1920 
ccccaatttc ctcaaccata aaatgaagat aataatgcct acctcagagg gatgctgacc 1980 
acagaccttt atagcagccc gtatgatatt attcacatta tgatatgtgt t tat tat tat 2040 
gtgactcttt ttacatttcc taaaggtttg agaattaaat atatttaatt atgatttaaa 2100 
aaaaaaaaaa 2110 



<210> 30 
<211> 7093 
<212> DNA 

<213> Homo sapiens 



<220> 

<221> xnisc^f eature 

<223> Incyte ID No: 7483337CB1 



<400> 30 

cgaggggacg cctcgcgacg gttcctggga gagctggcgg cggccttgct ctgcgcgctc 60 
ttcgcgccgc cctccccgcc cgcccgcctc aggattgagg aagtgcgtct gggcccggcc 120 
ccggcgcggg gggcagacgg cggtgggacg gccaggcccc ggccccgcca gtgtgtccgc 180 
ccggccccgc gtcccggagg agtcagctgt gtgtccagaa cgtgccatgg agacgcttaa 240 
cggtgccggg gacacgggcg gcaagccgtc cacgcggggc ggtgaccctg cagcgcggtc 300 
ccgcaggacg gaaggcatcc gcgccgcgta caggcgggga gaccgcggcg gcgcccggga 360 
cctgctggag gaggcctgcg accagtgcgc gtcccagctg gaaaagggcc agcttctgag 420 
catcccggca gcctatgggg atctggagat ggtccgctac ctactcagca agagactggt 480 
ggagctgccc accgagccca cggatgacaa cccagccgtg gtggcagcgt attttggaca 540 
cacggcagtt gtgcaaaata cgctgcccac cgagcccacg gatgacaacc cagccgtggt 600 
ggcagcgtat tttggacaca cggcagttgt gcaggaattg cttgagtcct taccaggtcc 660 
ctgcagtccc cagcggcttc tgaactggat gctggccttg gcttgccagc gagggcacct 720 
gggggttgtg aagctcctgg tcctgacgca cggggctgac ccggagagct acgctgtcag 780 
gaagaatgag ttccctgtca tcgtgcgctt gcccctgtat gcggccatca agtcagggaa 840 
tgaagacatt gcaa tattoo tgcttcggca tggggcctat ttctgttcct acatcttgct 900 
ggatagtcct gaccccagca aacatctgct gagaaagtac ttcattgaag ccagtccctt 960 
gcccagcagt tatccgggaa aaacagctct ccgtgtgaaa tggtcccatc tcagactgcc 1020 
ctgggtagac ctagactggc tcatagacat ctcctgccag atcacggagc tcgacctttc 1080 
tgccaactgc ctggcgaccc tcccctcggt tatcccctgg ggcctcatca atctccggaa 1140 
gctgaacctc tccgacaacc acctggggga gctgcctggc gtgcagtcat cggacgaaat 1200 
catctgttcc aggctacttg aaattgacat ttccagcaac aagttgtccc acctccctcc 1260 
tggattcttg cacctctcaa aacttcaaaa actgacagct tcaaaaaatt gtttagaaaa 1320 
attgttcgaa gaagaaaatg ccactaactg gataggttta cggaagctac aggaacttga 1380 
tatatctgac aataaattga cagaactccc tgccctgttc cttcactctt tcaagtccct 1440 
caattctctg aatgtctcca gaaacaacct gaaggtgttt ccagatccct gggcctgccc 1500 
tttgaaatgt tgtaaagctt ccagaaatgc cctggaatgt ctgccagaca aaatggctgt 1560 
cttttggaaa aatcacctga aggatgtgga tttctcagaa aacgcactca aagaagttcc 1620 
cctgggactt ttccagcttg atgccctcat gttcttgagg ttacagggga accagctggc 1680 
ggcacttcca cctcaagaga agtggacctg caggcagctc aaaaccctgg atctctccag 1740 
aaaccaactt ggcaaaaatg aagatggact gaaaacgaag cgtattgcct ttttcaccac 1800 
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cagaggtcgc cagcgctccg ggactgaggc agagacaact atggagttca gtgcatctct 1860 

ggtaaccatt gtgttcctgt ctaacaactg taacctctgt gcatacacat gtgcagcaag 1920 

tgtgctggaa tttccggcct tcctaagtga gtctttggaa gtcctttgcc tgaacgacaa 1980 

ccacctcgac acagtccctc cctcggtttg cctactgaag agcttatcag agctctactt 2040 

gggaaacaac cctggcctcc gggagctccc tcctgagctg gggcagctgg gcaacctctg 2100 

gcagctggac actgaagacc tgaccatcag caatgtgcct gcagaaatcc aaaaagaagg 2160 

ccccaaagca atgctgtctt acctgcgtgc tcagctgcgg aaagcggaaa agtgcaagct 2220 

gatgaagatg atcatcgtgg gtcccccgcg ccagggcaag tccaccctcc tggagatctt 2280 

acagacgggg agggcccccc aggtggtgca tggagaggcc aqcatcagga ccaccaagtg 2340 

ggagctccag aggccggctg gctcgagagc caaggtcaag gatggtctgc gtgcagagtc 2400 

cctgtgggtt gagtccgtgg agttcaacgt ctgggacatc gggggaccgg ccagcatggc 2460 

cactgtcaac cagtgcttct tcacggacaa ggccctgtac gtggtggtct ggaacctggc 2520 

gctgggggag gaggccgtgg ccaacctcca gttctggctg ctcaacatcg aggccaaggc 2580 

cccaaacgcc gtggtgctgg tggtcgggac gcacctggat ttaattgaag ccaagttccg 2640 

tgtggaaagg attgcaacgc tgcgtgccta tgtgctggca ctctgccgct ccccctccgg 2700 

ctccagggcc acaggcttcc cagacatcac cttcaaacac ttacatgaga tttcctgcaa 2760 

gagcctggaa ggtcaggaag ggctgcgaca gctgattttc cacgtcacgt gcagcatgaa 2820 

ggacgtcggc agcaccatcg gctgccagcg actggcaggg cggctgatcc ccaggagcta 2880 

cctgagcctg caggaggccg tgctggcaga gcagcagcgc cgcagccggg acgacgacgt 2940 

gcagtacctg acggacaggc agctggagca gctggtggag cagacgcccg acaacgacat 3000 

caaggactac gaggacctgc agtcagccat cagcttcctc atagaaaccg gcaccctgct 3060 

ccatttcccg gacaccagcc acggcctgag gaacctctac ttcctcgacc ctatttggct 3120 

ctccgaatgt ctgcagagga tctttaatat taagggctct cggtcagtgg ccaagaatgg 3180 

ggtgatcaga gcagaagacc tcaggatgct gctggtgggg actggcttca cgcagcagac 3240 

ggaagagcag tacttccagt tcctggccaa gtttgagatc gccctgcccg tcgccaatga 3300 

cagctacctc ctgccccatc tccttccatc taaacctggc ctggacaccc acggtatgcg 3360 

gcaccccaca gccaacacca ttcagagggt atttaagatg agcttcgttc ccgttggctt 3420 

ctggcaaagg tttatagcac ggatgctgat cagcctggcg gagatggacc tgcagctttt 3480 

tgaaaacaag aagaatacta aaagcaggaa caggaaagtc accatttaca gttttacagg 3540 

aaaccagaga aatcgctgta gcacattcag agtgaaaaga aatcagacca tctattggca 3600 

ggaagggctc ctggtcactt ttgatggggg ctacctcagt gtggaatctt ccgacgtgaa 3660 

ctggaaaaag aagaaaagcg gaggaatgaa aattgtttgc caatcagaag tgagggactt 3720 

ctcagccatg gctttcatca cggaccacgt caattccttg attgatcagt ggtttcccgc 3780 

cctgacagcc acagagagcg acgggacgcc actcatggag cagtacgtgc cctgcccggt 3840 

ctgcgagaca gcctgggccc agcacacgga ccccagtgag aaatcagagg atgtgcagta 3900 

cttcgacatg gaagactgtg tcctgacggc catcgagcgg gacttcatct cctgccccag 3960 

acacccggac ctccccgtgc cgctgcagga gctggtccct gaactgttca tgaccgactt 4020 

cccggccagg ctcttcctgg agaacagcaa gctggagcac agcgaggacg agggcagcgt 4080 

cctgggccag ggcggcagtg gcaccgtcat ctaccgggcc cggtaccagg gccagcctgt 4140 

ggccgtcaag cgcttccaca tcaaaaaatt caagaacttt gctaacgtac cggcagacac 4200 

catgctgagg cacctgcggg ccaccgatgc catgaagaac ttctccgagt tccggcagga 4260 

ggccagcatg ctgcacgcgc tgcagcaccc ctgcatcgtg gcgctcatcg gcatcagcat 4320 

ccacccgctc tgcttcgccc tggagctcgc gccgctcagc agcctcaaca ccgtgctgtc 4380 

cgagaacgcc agagattctt: cctttatacc cctgggacac atgctcaccc aaaaaatagc 4440 

ctaccagatc gcctcgggcc tggcctacct: gcacaagaaa aacatcatct tctgtgacct 4500 

gaagtcggac aacattctgg tgtggtccct tgacgtcaag gagcacatca acatcaagct 4560 

atctgactac gggatttcga ggcagtcatt ccatgagggc gccctaggcg tggagggcac 4620 

tcctggctac caggccccag agatcaggcc tcgcattgta tatgatgaga aggtagatat 4680 

gttctcctat ggaatggtgc tctacgagtt gctgtcagga cagcgccctg cactgggcca 4740 

ccaccagctc cagattgcca agaagctgtc caagggcatc cgcccggttc tggggcagcc 4800 

ggaggaagtg cagttccggc gactgcaggc gctcatgatg gagtgctggg acactaagcc 4860 

agagaagcga ccgctggccc tgtcggtggt gagccagatg aaggacccga cttttgccac 4920 

cttcatgtat gaactgtgct gtgggaagca gacagccttc ttctcatccc agggccagga 4980 

gtacaccgtg gtgttttggg atggaaaaga ggagtccagg aactacacgg tggtgaacac 5040 

agagaagggc ctcatggagg tgcagaggat gtgctgccct gggatgaagg tgagctgcca 5100 

gctccaggtc cagagatccc tgtggacagc caccgagaat tcctacctgg tcttagcggg 5160 
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cctcgccgat gggcttgtgg ctgtgtttcc 
ctcctacctg tgctcacaca cagccaacag 
acggcagaac ccctacccag tgaaggccat 
gtacagcaat gggccgggcc tccttgtcat 
gctggagccc tacatggccc cctccatggt 
aggggaggag gtcgtctggt gcctggatga 
caccacctac cagctgtgtg cccggtactt 
gtttcccgtg cggcccttgg acacggaacc 
ggtgcctgag ggggactcca tcgcggacgt 
gcagatcctg atccaccagg aatcactcac 
atccccaccc cgccaggctg ccaggtcccc 
ttccagtgtg cctttctcca ccgactgcga 
tgcctccgac aggtctgagc atgacctgac 
cctgcaggcc gtgaagatcc tcgccgtcag 
agatgttatc gtcattggcc tggagaagga 
cgtcttaaaa gcccgagagc tgactccgca 
aaaggacact gttgtgtgca cctttgaaaa 
gaggggctgg ggcgccaggg agttcgacat 
gctggaggct tgcactcgca agagaaggta 
gctggctggc ccggggctgc agcctgactc 
cctagaagac agatggagtt ctcccctgaa 
actcgcctgg cggtggctcc agggttctct 
ccatccccca actgctgatt ttacagcccc 
gcctcctctg gcctccccca tcagtttgca 
tgggtcaaac aaagcagggc cgggccttcc 
ctagaggccc tccgctggga agcacttgag 
cctttccccg ccagagacct caggctctca 
gcctgggcca gcttgggcaa gccaagatca 
tgggaaagcc cttgggggat cccgggtgag 
ttcctttaag tgccaccgcc agcaagccca 
cctttacatt cctctccacc gacaaaagga 
agggccttgt gtttatagct ttgtcaagta 
tttgttgcac ata 

<210> 31 
<211> 1800 
<212> DNA 

<213> Honio sapiens 
<220> 

<221> misc_f eature 

<223> Incyte ID No: 6035509CB1 

<400> 31 

gctgcagagt gctttacttt caacaagatg 
cagtgacaca gtcttggctc actgtaacct 
ctcagcctcc tgagtagctg ggattacagg 
cctatgatgc ttggacttga atcacttcca 
accattggta aaggcaccta tggcaaagtc 
ctggctgcag tgaaaattct ggatccagtc 
tacaacattt tgcagttcct tcctaatcat 
tacaaagcgg atcactgtgt agggggacag 
ggctcagtca ctgagcttgt caaaggtcta 
atgatctcat: acatcttgta cggggccctc 
atcatccacc gtgatgtgaa ggggaataac 
ctcgttgact ttggtgtttc agctcaactc 



cgtggtgcgg ggcaccccaa aggacagctg 5220 
gtccaagttc agcatcgcgg atgaagacgc 5280 
ggaggtggtc aacagcggct ctgaggtctg 5340 
cgactgtgcc tccctggaga tctgcaggcg 5400 
tacgtcagtc gtgtgcagct ctgagggcag 5460 
caaggccaac tccttggtga tgtaccactc 5520 
ctgcggggtc cccagccccc tcagggacat 5580 
cccggcagcc agccacacgg ccaacccaaa 5640 
gagcatcatg tacagtgagg agctgggcac 5700 
tgactactgc tccatgtcct cctactcctc 5760 
ctcaagcctc cccagctccc cagcaagttc 5820 
ggactcagac atgctacata cgcccggtgc 5880 
ccccatggac ggggagacct tcagccagca 5940 
agacctcatt tgggtcccca ggcgcggtgg 6000 
ttctggcgcc cagcggggcc gagtcattgc 6060 
tggggtgctg gtggatgctg ccgtggtggc 6120 
tgaaaacaca gagtggtgcc tggccgtctg 6180 
tttctaccag tcctacgagg agctgggccg 6240 
attcctgtgg aatgactgtc acacatcaga 6300 
ctctgccatc ggcctctagt tctccaagga 6360 
ctccttgctg ctaagaagtg ctgagaagtt 6420 
ggttctctgg agcagagttc tctgaatacc 6480 
agggaagaca gtggtatcag gctgggagcg 6540 
ggagcagggg tgcaggatcc tgttctgagc 6600 
tgccatcccc aggtctcaga tggaattaca 6660 
gtagggcagg aggggggctg tgacccctgc 6720 
gcacattcca caggctcctg agtccccgag 6780 
gatgtctctg tgttcgggaa ggtctccgtg 6840 
gagtgttgcc ccatccagag aatgaatgag 6900 
gaggcacaca ttctgagtgc acccgcttag 6960 
aggggaaact caatcagcag gacttcagaa 7020 
aatttggacg cagctggaaa cacaggcctg 7080 

7093 



gagtcttgct ctgtttccca gcctgtagtg 60 
ctgcctcctg ggttcaagtg attctcctgc 120 
aaacatctgt atggattatt tcactataat 180 
gatcccacag acacctggga aattatagag 240 
tacaaggtaa ctaacaagag agatgggagc 300 
agtgatatgg atgaagaaat tgaggcagaa 360 
cccaatgttg taaagtttta tgggatgttt 420 
ctgtggctgg tcctggagct gtgtaatggg 480 
ctcagatgtg gccagcggtt ggatgaagca 540 
ttgggccttc agcatttgca caacaaccga 600 
attcttctga caacagaagg aggagttaag 660 
accagtacac gtctgcggag aaacacatct 720 
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gttggcaccc cgttctggat ggcccctgag gtcattgcct gtgagcagca gtatgactct 780 
tcctatgacg ctcgctgtga cgtctggtcc ttggggatca cagctattga actgggggat 840 
ggagaccctc ccctctttga catgcatcct gtgaaaacac tctttaagat tccaagaaat 900 
cctccaccta ctttacttca tccagaaaaa tggtgtgaag aattcaacca ctttatttca 960 
cagtgtctta ttaaggattt tgaaaggcga ccttccgtca cacatctcct tgaccaccca 1020 
tttattaaag gagtacatgg aaaagttctg tttctgcaaa aacagctggc caaggttctc 1080 
caagaccaga agcatcaaaa tcctgttgct aaaaccaggc atgagaggat gcataccaga 1140 
agaccttatc atgtggaaga tgctgaaaaa tactgccttg aggatgattt ggtcaaccta 1200 
gaggttctgg atgaggtact aaatatttag tagacaattc tcattgaaga catttgtttc 1260 
atgtgaatgg tctgaacttt ctgttgtaga ccatgtcctc ctaaggtcat ttgaaaattt 1320 
aattgtttgt gtagctatgg gatgaagttc agggagcatt cagttgctgt gactatgatc 1380 
ctgtgctgtg tttatttaga tagcccctag aatgatgaag agaaaaggat ttggattttt 1440 
gcaataaagc tctttatatt gtagccttaa tgatggatta tatcagctga aaatattttg 1500 
tttgataaaa tttgataaaa tatttcaatt aacccttaag aagttgtttg ttcttcataa 1560 
gaaagagctt catttaggga aatagtgaag ttaatatagc ttgaattcta aatttgaagt 1620 
ctgtgataat ccccaCttaa aatatgcatg tttaatagag ctgttaattg cactggacct 1680 
gtttatgctg agtctaactc tggggattgt taccttcaat gtctaaatca ctaaagtgta 1740 
atacaaagtg gttaattctg tatttatgcc acctaggttt taagtgcagt gctttgagaa 1800 

<210> 32 

<211> 6347 . 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: 7373485CB1 

<400> 32 

ggaagcgaga agccgcatca accatgtaag cagcttcgct tcctgccgca accgtccgcg 60 
gcctgaggag cccaccgccg ctctcggggg ccgacttccg ggggctgagc cgttgaagcg 120 
gaggctgggg cggggggcag ccggcgcggc cggggcagga ggcgcagact catgaaatgg 180 
ccacagatga taagacgtcc ccaacactgg actctgctaa tgatttgcct cgatctccta 240 
ctagtccttc tcatctcaca cactttaaac ctttgactcc tgatcaagat gagccccctt 300 
ttaaatcagc ttatagttct tttgtaaatc tctttcgttt taacaaagag agagcagaag 360 
gaggccaggg agaacagcag cctttgagtg gaagttggac cagccctcag ctcccttcga 420 
ggacacagtc tgttaggtca cccacacctt ataaaaagca gcttaatgag gaactccagc 480 
ggcgctcttc agcattagac acaagaagga aagcagaacc tacctttgga ggtcatgacc 540 
ctcgtacagc tgttcagctt cgaagcctca gcacagtatt aaaacgcctc aaggaaatca 600 
tggaggggaa aagccaggat agtgacctga aacaatactg gatgccagat agccaatgta 660 
aagagtgcta tgactgtagt gagaaattta caacctttag gcgcagacac cattgccgac 720 
tatgtgggca gattttctgc agtcgttgct gtaatcaaga aatccctgga aaatttatgg 780 
gctatacagg agacctccga gcttgcacat attgtagaaa aatagcctta agttatgctc 840 
attccacaga cagtaattct attggggaag acttgaatgc tctttcagat tctgcttgct 900 
ctgtgtctgt gctitgatcca agtgaacccc gaacacctgt tgggagtagg aaagccagcc 960 
gtaacatatt tttagaggat gatttggcct ggcaaagttt gattcatcca gattcctcaa 1020 
atactcctct ttcaacaaga cttgtatctg tgcaagagga tgctgggaaa tctcctgctc 1080 
gaaatagatc agccagcatt actaacctgt cactggatag atctggttct cctatggtac 1140 
cttcatatga gacatctgtc agtccccagg ctaaccgaac atatgttagg acagagacca 1200 
ctgaggatga acgcaaaatt cttctggaca gtgtgcagtt aaaagacctg tggaaaaaaa 1260 
tctgccatca cagcagtgga atggagtttc aggatcaccg ctactggttg agaacgcatc 1320 
ccaactgcat tgtaggaaag gaattagtca actggctaat ccgaaatggg catattgcca 1380 
caagggcaca agctatagca attggacaag caatggttga tggacgttgg ctggattgtg 1440 
ttagtcatca cgaccagctt ttcagagatg agtatgcgct gtatagacca ctgcagagta 1500 
cagaattttc tgagacgcct tctcccgaca gtgactcagt gaactccgtg gaaggacact 1560 
ctgagccatc ctggtttaaa gacataaagt ttgatgacag tgacacagaa cagatagctg 1620 
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aagaaggtga cgataatttg gctaagtatt 
caataagtga cgctttcatc aaagaatcct 
aagagctgcc tttcacacct ttgggctggc 
agaatgggga gaaacaagcc atggagaggt 
cactactcca gcagttgctc catagtgact 
tgtcattggt ctgccaggtt gttcagacag 
acatggatat ccgtcagttt gtccacatca 
ctgtggttgt caatggcttt gtttgtacca 
gtattaaaaa ccctaaaatt cttctgttga 
aaactaagtt tacttgcatt gatcctattg 
atgtccagcg aatagttgat gttcgaccca 
ggattgccca ggacatgtta ttggaacatg 
aagttttgga acgaatcagt cgaatgaccc 
tgcttacgaa accacgcctg ggcacttgtc 
ctaatgaaca aaccaagaca ctgatgtttt 
caatcaagct aagaggaggc tctgattatg 
ttatgatctg tgttgcttat cattctcaac 
ctatgcctcc cacattaatg caaaaccctt 
atgagggggc tgtccaagag cagtacggtg 
ctcctgagtc tctgccctgt gatgatagca 
agggtgagca ggaaaataaa aatcttccgc 
atagcacaac agcttgcccg gcgggtctcc 
cattgttgcc actccctgtg gatgaccaac 
gtittgcagca aacagttgtg ctgcaggatc 
ctctacagga tgacactgga ttatatgtta 
gaaagactta ttctttggcc tttaagcagg 
cagtaatcac attccgagaa ccctttcttt 
gagattattt tgcagagcag gtttactggt 
tggagaacag gaggaagaaa cage tgc tea 
gaagtattca ggccaagtict: attcaagtct 
ttgctgagca tctgggcgat agccagagct 
gaggaggaag aattcagccc aaaaattcag 
gtacttcaag tggcaaatca ggaagcaaaa 
tgagtgatgc tgtgtggtca acaaaggtgg 
tttgtgtgct cttcagcagc tcttctgccc 
gtccttggat tgtaacaatg gaattttatg 
tagagagata ctgtttcagg ccttcttatc 
tggtacatca tattcggcgc tttgttcatg 
agttggattc tccagtacct ggatatcagc 
tctgcaaaca ggtaacacca gttgttgctc 
caaaatacct tgaacttagg ttttatgggc 
cctgtggtca ctccatccat catgattatc 
cgtctttcag ttattctccc attcggcttc 
tcattaagcg tcaggcccca ttaaaagtgt 
aaaaagtttc acaggtatat gttgccattg 
catttagtaa aacaagagag gaaaaaatgg 
aaggtgagtt caagaactgg attgagaaga 
atacccctca gcaactgcag tcggtctttg 
gtgaagtgct gcaagcttgg aataacaggt 
gaaagagacc ttcagttcct ccaagtcctg 
taagtgcgat ggatgcatct ccacggaata 
aggatcgctt cttaacaact ttgtccagcc 
tgcctacgcc acctgaagtc atgtctgaac 
cagccagcag ttccgaagat gtgtttgatg 
tgaaggaaaa gtcaaccatg aaagccatct 
atcctattcc atttcctttt gatccagata 



tgatttctga cactggagga caacagctct 1680 
tatttaatcg ccgagtagag gaaaaatcca 1740 
atcataacaa cctggagctc ctgagggagg 1800 
tgctttcagc taatcataac cacatgatgg 1860 
cactgtcatc atcttggagg gacatcatcg 1920 
tccgacctga tgtcaagaac caggatgatg 1980 
aaaaaatccc aggtggaaag aagtttgatt 2040 
agaacattgc acataaaaag atgaattctt 2100 
agtgttccat tgagtatctc tacagagaag 2160 
tgcttcagga aagggaattc ttgaagaatt 2220 
ccttggttct tgttigagaaa acagtgtctc 2280 
gcattacttt ggtcattaat gtaaagtcac 2340 
aaggtgattt agtgatgtca atggaccagc 2400 
acaaatttta tatgcagata tttcagttgc 2460 
ttgaaggttg tccacagcac ctaggctgta 2520 
agctggctcg agttaaggag atcctaatat 2580 
tagaaatatc ctttctcatg gatgaatttg 2640 
cattccattc cctgattgag ggacgagggc 2700 
gaggttccat cccctgggat cctgacatcc 2760 
gtttgctgga atcgaggatt gtgtttgaga 2820 
aggctgttgc ctctgtgaag catcaagaac 2880 
cttgtgcttt ctttgcacct gtaccggaat 2940 
aagatgcttt aggcagcgag ctgccagaga 3000 
ccaaaagcca gataagagcc tttagagacc 3060 
ctgaggaagt cacctcctct gaagataaac 3120 
aattaaaaga tgtgatcctc tgtatctccc 3180 
taactgaaaa ggggatgaga tgctctaccc 3240 
ctcctctcct caataaagaa ttcaaagaaa 3300 
gggatctctc tggacttcag ggcatgaatg 3360 
taccctcaca tgagctagtg agcactagaa 3420 
tgggtagaat gctggccgat tatcgagcca 3480 
acccttttgc tcattcaaag gatgcatcaa 3540 
acgagggtga tgaagagaga gggcttattc 3600 
actgtctgaa tcccattaat caccagagac 3660 
agtccagcaa tgctcctagt gcctgtgtca 3720 
gaaagaatga tcttacatta ggaatatttt 3780 
agtgtccaag catgttctgt gataccccca 3840 
gccaaggctg tgtgcagata atcctgaagg 3900 
atacaattct tacatattcc tggtgtagaa 3960 
tttccaatga gtcctggtct atgtcatttg 4020 
accagtatac tcgcagagcc aacgctgagc 4080 
accagtattt ctcctataac cagatggtgg 4140 
ttgaagtatg tgttccactc cccaaaatat 4200 
cccttcttca ggatctgaag gacttctttc 4260 
atgaaagact tgcatctttg aaaactgata 4320 
aagatatttt tgcacagaaa gagatggaag 4380 
tgcaagcaag gctcatgtct tcctctgtag 4440 
agtcactcat tgccaagaaa caaagtctct 4500 
tgcaggacct tttccaacag gaaaagggta 4560 
gaagactgag acaaggggaa gaaagcaaga 4620 
tttctccagg acttcagaat ggagaaaaag 4680 
agagctccac cagttctact catctccaat 4740 
agtcagtggg agggccccct gagctagata 4800 
ggcatttgct gggatccaca gacagccaag 4860 
ttgcaaattt gcttccagga aatagctata 4920 
aacactactt aatgtatgaa catgaacgag 4980 
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tgcccattgc agtctgcgag aaggaaccca gctccatcat tgcttttgct ctcagttgta 5040. 
aagaataccg aaatgcctta gaggaattgt ctaaagcgac tcagtggaac agtgccgaag 5100 
aagggcttcc aacaaatagt acttcagata gcagaccaaa gagtagcagc cctatcagat 5160 
tacctgaaat gagtggagga cagacaaatc gtacaacaga aacagaacca caaccaacca 5220 
aaaaggcttc tggaatgctg tccttcttca gagggacagc agggaaaagc cccgatctct 5280 
cttcccagaa gagagagacc ttacgtggag cagatagtgc ttactaccag gttgggcaga 5340 
caggcaagga ggggaccgag aatcaaggcg ttgagcctca agatgaagta gatggaggag 5400 
atacgcaaaa gaagcaactc ataaatcctc atgtggaact tcaattttca gatgctaatg 5460 
ccaagtttta ctgtcggctc tactatgcgg gagagtttca taagatgcgt gaagtgattc 5520 
tggacagcag tgaggaagat ttcattcgtt ccctctccca ctcatcaccc tggcaggccc 5580 
ggggaggcaa atcaggagct gccttctatg caactgagga tgatagattt attttgaagc 5640 
aaatgcctcg tctggaagtc cagtccttcc tcgactttgc accacattac ttcaattata 5700 
ttacaaatgc tgttcaacaa aagaggccca cggcgttggc caaaattctt ggagtttaca 5760 
gaattggtta taagaactct cagaacaaca ctgagaagaa gttagatctc cttgtcatgg 5820 
aaaatctttt ctacgggaga aagatggcac aggtttttga tttgaagggc tctcttagga 5880 
atcggaatgt aaaaactgac actggaaaag agagttgtga tgtggtcctg ctagatgaaa 5940 
atctcctaaa gatggttcga gacaaccctc tatatattcg ttctcattcc aaagctgtgc 6000 
tgagaacctc gatccatagt gactcccatt tcctttctag ccacctcatt atagattatt 6060 
ctttgctggt tgggcgagat gatactagca atgagctagt agttggaatt atagattata 6120 
ttcgaacatt tacatgggac aaaaagcttg agatggttgt gaaatcaaca ggaattttag 6180 
gtggacaagg taaaatgcca acagtggtgt ctccggagtt gtacaggact aggttttgtg 6240 
aggcaatgga caagtatttc ctaatggtac cagaccactg gacaggcttg ggtctgaatt 6300 
gctgaaatca agacatattt gaaatggact gtgaggaaaa ggggaac 6347 

<210> 33 
<211> 1876 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 5734965CB1 

<400> 33 

tggggttcgg cgcggctacg 
aagtactgtg gaggaactgc 
ggtgtggacg ttggaacagg 
ttggcttttg cagaccagcc 
tcctccgagg acatctgggc 
gatttaaacc aaattcgagg 
aagcagtttc acccattacc 
tggctggacc atcgagcagt 
ctccagtacg tcgggggggt 
aaagagaact tgagagagat 
ttcttatcgt ggaaggcaac 
tggacatatt cagcagagaa 
gactttgttg cagataatta 
cttggaaatg ggctcacacc 
gtcgcagctt cactcattga 
agagggcacg gcctcatctg 
ggaacgtctt cttgtcacat 
gggccttatt tctcagccat 
actggaaaat tgatagacca 
aaggccacag ccagatgcca 
aagaaggctc agcctgtggg 
ggcaaccggt ctcccttagc 
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tgcagaatcc gtctagctaa 
aatgtctggt ggagaacaga 
cagtgtccgt gcagctctgg 
aattaagaat tgggagcccc 
tgcgtgctgt gttgtcacaa 
acttgggttt gatgccacgt 
agtcaaccag gaaggggatt 
cagtcaagtt aacaggatca 
gatgtctgtg gaaatgcagg 
ttgctgggat aaggcgggac 
a^gtgtcaca gcacggtctc 
aggctgggac gacagtttct 
cagcaaaata ggaaaccaag 
agaggcagca agagaccttg 
tgcccatgca ggaggactag 
tgaggggcag ccagtgacgt 
ggggatcagc aaagacccga 
ggtacctggg ttctggctga 
catggtacaa ggccatgctg 
gagtatatat gcatatttga 
tttccttact gttgatttac 
agatctgaca ctaaagggca 



aatgtaattt cagattggac 60 
aaccagagag gtactatgtg 120 
tggaccagag tggggtcctg 180 
agttcaacca ccatgagcag 240 
agaaagttgt acaagggatt 300 
gttctctggt tgttttggat 360 
cccatcgaaa cgtcatcatg 420 
atgagaccaa gcacagtgtc 480 
ccccgaaact tctgtggctg 540 
atttctttga tctcccggac 600 
tctgctccct ggtgtgtaag 660 
ggaaaatgat tggtttggaa 720 
tgctacctcc tggagcttct 780 
gccttctccc tgggattgcg 840 
gagtgattgg ggcagatgtg 900 
cacggctggc tgtcatctgt 960 
tttttgtacc aggcgtctgg 1020 
atgaaggtgg tcagagcgtt 1080 
cttttccaga actacaagta 1140 
acagtcacct ggatctgatt 1200 
atgtttggcc agattCccat 1260 
tggtcaccgg attgaaactg 1320 
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tctcaggacc ttgatgatct tgccattctc tacctggcca cagttcaagc cattgctttg 1380 
gggactcgct tcattataga agccatggag gcagcagggc actcaatcag tactcttttc 1440 
ctatgtggag gcctcagcaa gaatcccctt tttgtgcaaa tgcatgcgga cattactggc 1500 
atgcctgtgg tcctgtcgca agaggtggag tccgttcttg tgggtgctgc tgttctgggt 1560 
gcctgtgcct caggggattt cgcttctgta caggaagcaa tggcaaaaat gagcaaagtt 1620 
gggaaagttg tgttcccgag actacaggat aaaaaatact atgataagaa ataccaagta 1680 
ttcctgaagc tggttgaaca ccagaaggag tatttggcga tcatgaatga tgactgaaca 1740 
gggcttgcag gtgctgatgc cagaagcttc tgtgccattg cattaaagac ttctgtcatt 1800 
tgatccatgt tcaagaccct tgaggtattg tttcatcatt tctgtattgt ctttcaataa 1860 
agaatacaaa catgtg 1876 

<210> 34 

<211> 1487 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<223> Incyte ID No: 7473788CB1 



<400> 34 

atgaggagtg gcgccgaacg caggggcagc 
cccggccgcg cgcgccccgc cggctccgac 
ggccagcccc gggcccggga cticgggcgat 
tggagcaagt ggaagaagag gatgggctcg 
gtgtttgacg acaaggagga cgtgaacttc 
aagggcagct ttggcaaggt gtgcattgtg 
atgaagtaca tgaacaagca gcagtgcatc 
gagctggaga tcctgcagga gatcgagcac 
caggacgagg aggacatgtt catggtcgtg 
cacctgcagc agaacgtgca gttctccgag 
gcactggctc tggactacct gcgcggccag 
aacattctcc tggatgagag aggacatgca 
atcaaggacg gggagcgggc gacggcatta 
atcttccact cttttgtcaa cggcgggacc 
gtgggggtga tggcctatga gctgctgcga 
aacgccgtgg agtccctggt gcagctgt:tc 
tggtccaagg agatggtggc cttgctgcgg 
ctctccagcc tccaggacgt gcaggcagcc 
ctgagcgaga agagggtgga gccgggcttc 
cccacctttg agctggagga gatgatcctg 
cgcctggcca agaacaagtc ccgggacaac 
tatcttcaag actgcctcga tgccatccag 
ctgaagagga gccaggacct cccgagggag 
cggagcctgt ggaggacgag gcggacgctc 
tcggccggga gcggctaggc cgggacgccc 



agcgccgcgg cgtccccggg ctcgccgccc 60 
gcgccctcgg ccctgccgcc gcccgctgct 120 
gtccgctcgc agccgcgccc cctgtttcag 180 
tccatgtcgg cggccaccgc gcggaggccg 240 
gaccacttcc agatccttcg ggccattggg 300 
cagaagcggg acacggagaa gatgtacgcc 360 
gagcgcgacg aggtccgcaa cgtcttccgg 420 
gtcttcctgg tgaacctctg gtactccttc 480 
gacctgctac tgggcgggga cctgcgctac 540 
gacacggtga ggctgtacat ctgcgagatg 600 
cacatcatcc acagagatgt caagcctgac 660 
cacctgaccg acttcaacat tgccaccatc 720 
gcaggcacca agccgtacat: ggctccggag 780 
ggctactcct tcgaggtgga ctggtggtcg 840 
ggatggaggc cctatgacat ccactccagc 900 
agcaccgtga gcgtccagta tgtccccacg 960 
aagctcctca ctgtgaaccc cgagcaccgg 1020 
ccggcgctgg ccggcgtgct gtgggaccac 1080 
gtgcccaaca aaggccgtct gcactgcgac 1140 
gagtccaggc ccctgcacaa gaagaagaag 1200 
agcagggaca gctcccagtc cgagaatgac 1260 
caagacttcg tgatttttaa cagagaaaag 1320 
cctctccccg ccctgagtcc agggatgctg 1380 
cgcctgccca tgtgcggccc catttgcccc 1440 
gtggtcctca ccccttg 1487 



<210> 35 

<211> 1884 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc.feature 

<223> Incyte ID No: 3107989CB1 
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<400> 35 

9aggtgacca attttctctc caaaagagaa aggaagttga ttaaaaaaag aatccatgct 60 
ccaaagcggc agccaaatcc atctatggcc cccaatgcat cacccagaaa ggggttccag 120 
actctcctgc aaaaggccaa ctctacttcc cggctcccac ttcccctcct tcgccacagg 180 
agggtggcga aggatttata acccacctct ttctttcagt tgccatggag acaagcccca 240 
gtcctttcat tccttctggt acctctctct ccaacgcagg cggaaaggag gcggcttagc 300 
ccaaacatgc tgggggaggg gctggcggcc tcgacggcag ctgcggaact aggccgaggg 360 
acaaaggcta agtttttcca tggtttggac tggatatcgg tggaactctg gtcaagctgg 420 
tatattttga acccaaagac atcactgctg aagaagaaga ggaagaagtg gaaagtctta 480 
aaagcattcg gaagtacctg acctccaatg tggcttatgg gtctacaggc attcgggacg 540 
tgcacctcga gctgaaggac ctgactctgt gtggacgcaa aggcaatctg cactttatac 600 
gctttcccac tcatgacatg cctgctttta ttcaaatggg cagagataaa aacttctcga 660 
gtctccacac tgtcttttgt gccactggag gtggagcgta caaatttgag caggattttc 720 
tcacaatagg tgatcttcag ctttgcaaac tggatgaact agattgcttg atcaaaggaa 780 
ttttatacat tgactcagtc ggattcaatg gacggtcaca gtgctattac tttgaaaacc 840 
ctgctgattc tgaaaagtgt cagaagttac catttgattt gaaaaatccg tatcctctgc 900 
ttctggtgaa cattggctca ggggttagca tcttagcagt atattccaaa gataattaca 960 
aacgggtcac aggtactagt cttggaggag gaactttttt tggtctctgc tgtcttctta 1020 
ctggctgtac cacttttgaa gaagctcttg aaatggcatc tcgtggagat agcaccaaag 1080 
tggataaact agtacgagat atttatggag gggactatga gaggtttgga ctgccaggct 1140 
gggctgtggc ttcaagcttt ggaaacatga tgagcaagga aaagcgagat tccatcagca 1200 
aggaagacct cgcccgggcc acattggtca ccatcaccaa caacattggc tccattgctc 1260 
ggatgtgtgc gttgaatgag aacatagaca gagttgtgtt tgttggaaat tttctcagaa 1320 
tcaatatggt ctccatgaag ctgctggcat atgccatgga tttttggtcc aaaggacaac 1380 
tgaaagctct gtttttggaa catgagggtt attttggagc cgttggggca ctgttggaac 1440 
tgttcaaaat gactgatgac aagtagagac gagcagtgga ggaaacagcc tcccaaaagg 1500 
acagagaact aaaaaattgc tgctggagaa ggtgaaagtc gctttgggac ggaagccaag 1560 
ccattatggc agatgaacct gctggatttg taaataattt aaaatccttc cagatgatct 1620 
tttactctta ggttttgagc taatgattca aaacggggga atataaaagg ttttttttct 1680 
gtatactgta tttttttaaa aaaatggtgc agcgtggcca aacctaccaa ttgtatgcat 1740 
taactttgaa aagttgtttg atgtttaaga aggacctgat atgtaagcgc tggtcatttt 1800 
tcttctgggg tttactgatc agtgtggtga ttttaacttc atttagtaat tactctagga 1860 
gattttacct tgacttatat tttc 1884 

<210> 36 
<211> 1070 
<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc^f eature 

<223> Incyte ID No: 7482887CB1 



<400> 36 

gcaaatcaca cagcatiggca gctcccagtc 
ctttaaaaac ctgggcattc cctccacaaa 
ttcctcttgc tggcacagat cataaagtct 
tttggcttct ttctacaagc aaggagcagc 
acttgagtta aatccgcaca acgaatctta 
gccaaggcgg gtggatcacc tgaggtcagg 
aacctatctc tactaaaaat acaaaaaatt 
tcagctactt gggaggctaa ggcaggagaa 
tgggccaaga tagcgccact gcactccagc 
aataataata ataatttctt ggctccaagt 
atcctcaggc catggccaac actgagagca 
gcctggtggg tgaaatcatc aaatactctg 



ctcctgcctc ttctgcattc cagacctgct 60 
ttgaagagtg gaattttttt tcacctgctc 120 
tgctctcttt ctatcacatc tcattattat 180 
aggccctttt acattaccat tagtgaaggc 240 
ctcttgcctg taatcccagc actttggaaa 300 
agttcgagac cagcctggcc aatgtggtaa 360 
ggccaggtgt ggtggtgggg gcctgtaatc 420 
ttgcttgaat ctgggagaca gaggttgcgg 480 
cttagcaaca agagcacaac tccatctcaa 540 
ctcagctccc gcaccacctg acactgtcag 600 
tcattatcaa tccgagtgct gttcagcaca 660 
agcagaaggg attctacctg gtgaccatga 720 
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agttccttcg ggcctctgag aaacccctga 
cattcttccc ggaccttgtg aagtacatga 
aggggctgaa tgtggcaaag acagggctaa 
ctatgctaga gactattatt cgcagggact 
gtggcagtga ttcattacaa agtgctgaaa 
aaccagttga ctacagatct tgtgcttatg 



agccgcacta cactaacctg aaagaccacc 780 
actcagggca ggttgtggcc atggtcctgg 840 
ggatgcttgg ggagaccaat tcattgggct 900 
tctgcgctaa aataggcggg aacgtcattg 960 
aagaaatcag cctatggttt aagcccaaag 1020 
actgggtcta tgcatgatag 1070 



<210> 37 

<211> 2890 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<223> Incyte ID No: 2963414CB1 



<400> 37 

gtgacccttc cctccccagg ccacggcagc 
gggctctggt gaacagcaaa tgctccacgc 
tggtgggcac tgagagacaa gattcctggg 
ggaggaagct ctgaggaccc cgctgagaac 
agcaataaag gccaagaagg agaaagttga 
agctgccctt tctacttcaa accttcctct 
ccatctgccg ccatggttgt gcaaaacagc 
cagctggagc ccttcctgca ccaggtcggg 
catacggtgt gcaagcccct cgtctcccgg 
gccatgaagc ggttcacccc acagtacaaa 
agcacaggcc atctcagctt ggttgccaac 
gtctccacag agtcggcggc ggtggccata 
aatggcagcg actgcaccct tgcccagtgg 
gagagcccgg ccaaggctct tctgaggtcc 
ctggtggaag acaccaacgg aaaccaggtt 
caatgccacc aggcccacct: gacccgcctg 
cggttcttgt tgctggaaaa tgtagtgtca 
aagatgggga cccggcagca cggcgatgat 
aggaagtgtg cgcagagcac ctcagcctgc 
tatcaaacag ataagaagta ctttctctgc 
gtggaggggt: tcagacaagc cctctatcag 
gagctcctgg agcccatcct gcaccagctc 
agttcatacc gcttctattc cagctctctc 
gaaagagccc caggcagccc gcatcctcac 
cccggtggtc tcaccaaggt tgacatccgc 
ggctactgga atgagcacac cacctacgat 
gaaaacctca tcaggatcct gcaggatatc 
tctggattct tctgggctat agatctcaaa 
gacacccctt: agatgtcttc ataatagtcc 
ggcagactat attaacagct gctgaacaaa 
gcattatgct ctaatgctgc tcatcggaga 
ggagtacact ggagggggca gcccaagtta 
ttgcatctaa aaatgtttac tcgttgccat 
aacttgactt gaagcagcct tgaagagtga 
tggactgctc gctccaaggg caggcagagc 
gttttaggtg acacaggatg aagcagaaga 
gcttctaagt ggacaaagcc aagcccaggc 
ctccattttt gcctgtggaa aggagggtcc 
ccaaaatctc ttaaaatgag gaatctctta 



ccggccctcc cgggcagacc tcccgcacca 60 
tgggacgggc cattgcctga tgcctgtaca 120 
ccctgccttc catacactcc ccacgatctc 180 
ccacagacag gaggacaact gcgctatgac 240 
ggaccgctga cagccccgtg tgctgttggg 300 
agcagactgt gcagggaccc cccaccacca 360 
gcagacgccg gggacatgag ggcaggcgtg 420 
gggcacatga gcgtgatgaa gtatgacgag 480 
gagcagaggt tctatgaatc cctgccgctg 540 
ggtaccgtca cagtgcacct ctggaaagac 600 
ccagtgaagg agagccagga gcccttcaag 660 
tggcagacgc tccagcagac caccggcagc 720 
ccgcatgccc agctggcacg ctcacccaag 780 
gagccccacc tcaacactcc agccttctcg 840 
gagaggaaga gcttcaaccc gtggggcctg 900 
tgctccgagt acccagagaa caagcggcat 960 
cagtacacgc atccctgtgt cctggatctg 1020 
gcatcggagg agaagaaggc ccgccacatg 1080 
ctgggtgtgc gcatctgcgg catgcaggtt 1140 
aaagacaagt actatggaag aaaactctca 1200 
ttcctacata atggaagcca cctccggagg 1260 
cgggccctcc tctccatcat taggagccag 1320 
cttgtcatct atgatgggca ggaaccacca 1380 
gaggctcccc aggcagccca cggtagctct 1440 
atgattgact ttgctcatac cacatacaag 1500 
ggaccagacc ctggctatat ttttggcctg 1560 
caagagggag aatgaaactt cctgggctta 1620 
tagagacctg ttggttgcta gggtagtcca 1680 
tatctacctt caaaaaccat ctctatatat 1740 
tcagctctgg aggtgattcc acatcccctg 1800 
acagacagcc aggataaagt ggcaccttct 1860 
gaggccagca ttgctgacat tctggaatat 1920 
gctgcagtcc gcacaagctg tgaggcagaa 1980 
gttcatgagc tcatggtttt tctccttgta 2040 
tcatgaatgc ctcttatctt cctaagcgga 2100 
gatctaccca tctcacctgc tctgcaccca 2160 
atgagctctg gcaaagcaag accccagatt 2220 
ctttacaggc ttttttttcc tttttttccc 2280 
gcagactttg gagttcccca ttctgccaca 2340 
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ttctgaccat gagacgcggc ttgcagtggg ggtgaacgca cataaaaagg gaccactgac 2400 
gtcctgctct actctctgct ttctatttat ttattttggg ggtgggttgg ggagtcagaa 2460 
gaacctggag gacggaggaa accaggggca atgtttacaa gactggtgga caagtgtaaa 2520 
tatggaataa gaacaaacag ttctaattaa ttccttcttc tgcagtacgg aaacctatta 2580 
caatgccctt gagtcaagca ctgagatacg ttacccaatt agggaaataa atttgttaat 2640 
aaaattgctg aggtcaccag tgattattgg tgtgccttat taccctttcc atttgtttat 2700 
tctgatcaca ctgtgtggta gttccaattt atgagcgact agcatatacc acaagaacag 2760 
ttcactgatt tcctacaatc cttcagggaa ctcgggtgga aatggtggct aataaaatat 2820 
ttgcatgtat ctgcaaggga ggcaccagac ctgagaagtg gtccttttat ttgaatctca 2880 
tacaatgtac 2890 

<210> 38 
<211> 5198 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_£eature 

<223> Incyte ID No: 7477139C31 

<400> 38 

cgacacggag cacccttcta gcttcttcgt ctccaggact gacgctcagg ctcctctctc 60 
gccttagccc aacttgcttt cccgcctcgc aaactccggt ttccctccac tcccaactct 120 
tttcactaca cgtttcccct cctcCatctc ccacgccacg aaccccgatc cccagactcc 180 
tctctcccgc cctcctcctt cctctctcct cccttcaact cttcatccgc ttccacctca' 240 
gactctgcgc gcacccaatt cagtcgcccg ctcccgttcg gctcctcgaa gccatggcgg 300 
gacctggggg ctggagggac agggaggtca cggatctggg ccacctgccg gatccaactg 360 
gaatattctc actagataaa accattggcc ttggtactta tggcagaatc tatttgggac 420 
ttcatgagaa gactggtgca tttacagctg ttaaagtgat gaacgctcgt aagacccctt 480 
tacctgaaat aggaaggcga gtgagagtga ataaatatca aaaatctgtt gggtggagat 540 
acagtgatga ggaagaggat ctcaggactg aactcaacct tctgaggaag tactctttcc 600 
acaaaaacat tgtgtccttc tatggagcat ttttcaagct gagtccccct ggtcagcggc 660 
accaactttg gatggtgatg gagttatgtg cagcaggttc ggtcactgat gtagtgagaa 720 
tgaccagtaa tcagagttta aaagaagatt ggattgctta tatctgccga gaaatccttc 780 
agggcttagc tcaccttcac gcacaccgag taattcaccg ggacatcaaa ggtcagaatg 840 
tgctgctgac tcataatgct gaagtaaaac tggttgattt tggagtgagt gcccaggtga 900 
gcagaactaa tggaagaagg aatagtttca ttgggacacc atactggatg gcacctgagg 960 
tgattgactg tgatgaggac ccaagacgct cctatgatta cagaagtgat gtgtggtctg 1020 
tgggaattac tgccattgaa atggctgaag gagcccctcc tctgtgtaac cttcaaccct 1080 
tggaagctct cttcgttatt ttgcgggaat ctgctcccac agtcaaatcc agcggatggt 1140 
cccgtaagtt ccacaatttc atggaaaagt gtacgataaa aaatttcctg tttcgtccta 1200 
cttctgcaaa catgcttcaa cacccatttg ttcgggatat aaaaaatgaa cgacatgttg 1260 
ttgagtcatt aacaaggcat cttactggaa tcattaaaaa aagacagaaa aaaggaatac 1320 
ctttgatctt tgaaagagaa gaagctatta aggaacagta caccgtgaga agattcagag 1380 
gaccctcttg cactcacgag cttctgagat tgccaaccag cagcagatgc agaccactta 1440 
gagtcctgca tggggaaccc tctcagccaa ggtggctacc tgatcgagaa gagccacagg 1500 
tccaggcact tcagcagcta cagggagcag ccagggtatt catgccactg caggctctgg 1560 
acagtgcacc taagcctcCa aaggggcagg ctcaggcacc tcaacgacta caaggggcag 1620 
ctcgggtgtt catgccacta caggctcagg tgaaggctaa agcctctaaa cctctacaaa 1680 
tgcagattaa ggcacctcca cgactacgga gggcagccag ggtgctcatg ccactacagg 1740 
cacaggttag ggcacctagg cttctgcagg tacagtccca ggtatccaaa aagcagcagg 1800 
cccagaccca gacatcagaa ccacaagatt tggaccaggt accagaggaa tttcagggtc 1860 
aagatcaggt acccgaacaa caaaggcagg gccaggcccc tgaacaacag cagaggcaca 1920 
accaggtgcc tgaacaagag ctggagcaga accaggcacc tgaacagcca gaggtacagg 1980 
aacaggctgc cgagcctgca caggcagaga ctgaggcaga ggaacctgag tcattacgag 2040 
taaatgccca ggtatttctg cccctgctat cacaagatca ccatgtgctg ttgccactac 2100 
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atttggatac tcaggtgctc attccagtag aggggcaaac tgaaggatca cctcaggcac 2160 
aggcttggac actagaaccc ccacaggcaa ttggctcagt tcaagcactg atagagggac 2220 
tatcaagaga cttgcttcgg gcaccaaact caaataactc aaagccactt ggtccgttgc 2280 
aaaccctgat ggaaaatctg tcatcaaata ggttttactc acaaccagaa caggcacggg 2340 
agaaaaaatc aaaagtttct actctgaggc aagcactggc aaaaagacta tcaccaaaga 2400 
ggttcagggc aaagtcatca tggagacctg aaaagcttga actctcggat ttagaagccc 2460 
gcaggcaaag gcgccaacgc agatgggaag atatctttaa tcagcatgag gaagaattga 2520 
gacaagttga taaaaccagt tggcgtcagt ggggtccttc agaccagttg attgacaata 2580 
gtttcactgg tatgcaagac ctgaagaaat atctcaaagg aaaaacaaca tttcataatg 2640 
ttcaagttgt tatctacaga gcagttaagg ggaatgatga tgttgcaaca aggtctaccg 2700 
ttcctcagcg gtctcttttg gaacaagctc agaagcccat tgacatcaga caaaggagtt 2760 
cgcaaaatcg tcaaaattgg ctggcagcat caggtgattc aaagcacaaa attttagcag 2820 
gcaaaacaca gagctactgt ttaacaattt atatttcaga agtcaagaaa gaagaatttc 2880 
aagaaggaat gaatcaaaag tgtcagggag cccaagtagg attaggacct gaaggccatt 2940 
gtatttggca attgggtgaa tcttcttctg aggaagaaag tcctgtgact ggaaggaggt 3000 
ctcagtcatc accaccttat tctactattg atcagaagtt gctggttgac atccatgttc 3060 
cagatggatt taaagtagga aaaatatcac cccctgtata cttgacaaac gaatgggtag 3120 
gctataatgc actctctgaa atcttccgga atgattggtt aactccggca cctgtcattc 3180 
agccacctga agaggatggt gattatgttg aactctatga tgccagtgct gatactgatg 3240 
gtgatgatga tgatgagtct aatgatactt ttgaagatac ctatgatcat gccaatggca 3300 
atgatgactt ggataaccag gttgatcagg ctaatgatgt ttgtaaagac catgatgatg 3360 
acaacaataa gtttgttgat gatgtaaata ataattatta tgaggcgcct agttgtccaa 3420 
gcttgttgtc agggcaagct atggcagaga tggaagctgc aagcaagatg gttatgatgg 3480 
aagtcgtgga aaagaggaag cctacagagg ctatggaagc catacagcca atagaagcca 3540 
tggaggaagt gcagccagtg agggacaatg cagccattgg agatcaggaa gaacatgcag 3600 
ccaatatagg cagtgaaaga agaggcagtg agggtgatgg aggtaaggga gtcgttcgaa 3660 
ccagtgaaga gagtggagcc cttggactca atggagaaga aaattgctca gagacagatg 3720 
gtccaggatt gaagagacct gcgtctcagg actttgaata tctacaggag gagccaggtg 3780 
gtggaaatga ggcctcaaat gccattgact caggtgctgc accgtcagca cctgatcatg 3840 
agagtgacaa taaggacata tcagaatcat caacacaatc agatttttct gccaatcact 3900 
catctccttc caaaggttct gggatgtctg ctgatgctaa ctttgccagt gccatcttat 3960 
acgctggatt cgtagaagta cctgaggaat cacctaagca accctctgaa gtcaatgtta 4020 
acccactcta tgtctctcct gcatgtaaaa aaccactaat ccacatgtat gaaaaggagt 4080 
tcacttctga gatctgctgt ggttctttgt ggggagtcaa tttgctgttg ggaacccgat 4140 
ctaatctata tctgatggac agaagtggaa aggctgacat tactaaactt ataaggcgaa 4200 
gaccattccg ccagattcaa gtcttagagc cactcaattt gctgattacc atctcaggtc 4260 
ataagaacag acttcgggtg tatcatctga cctggttgag gaacaagatt ttgaataatg 4320 
atccagaaag taaaagaagg caagaagaaa tgctgaagac agaggaagcc tgcaaagcta 4380 
ttgataagtt aacaggctgt gaacacttca gtgtcctcca acatgaagaa acaacatata 4440 
ttgcaattgc tttgaaatca tcaattcacc tttatgcatg ggcaccaaag tcctttgatg 4500 
aaagcactgc tattaaagta tttccaacac ttgatcataa gccagtgaca gttgacctgg 4560 
ctattggttc tgaaaaaaga ctaaagattt tcttcagctc agcagatgga tatcacctca 4620 
tcgatgcaga atctgaggtt atgtctgatg tgaccctgcc aaagaataat atcatcattt 4680 
tacctgattg cttgggaatt ggcatgatgc tcaccttcaa tgctgaagcc ctctctgtgg 4740 
aagcaaatga acaactcttc aagaagatcc ttgaaatgtg gaaagacata ccatcttcta 4800 
tagcttttga atgtacacag cgaaccacag gatggggcca aaaggccatt gaagtgcgct 4860 
ctttgcaatc cagggttctg gaaagtgagc tgaagcgcag gtcaattaag aagctgagat 4920 
tcctgtgcac ccggggtgac aagctgttct ttacctctac cctgcgcaat caccacagcc 4980 
gggtttactt catgacactt ggaaaacttg aagagctcca aagcaattat gatgtctaaa 5040 
agtttccagt gatttattac cacattataa acatcatgta taggcagtct gcatcttcag 5100 
atttcagaga ttaaatgagt attcagtttt atttttagta aagattaaat ccaaaacttt 5160 
acttttaatg tagcacagaa tagttttaat gagaaatg 5198 

<210> 39 
<211> 3969 
<212> DNA 
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<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: 55009053CB1 

<400> 39 

cttttttcct ttcagtgtgc ttcaaatgtc acgacacagg ttagctcagt cgacttgggg 60 
ctgctgagct ctggtccctg ccagcctcac cgctcggacc cccccgatcc tccggactcc 120 
gctggtcctg gccacgcgag gagcccacgc tagctccaaa gaatcccccg agggcacgtg 180 
gaccgaggga gcccctgtga aggctgcgga agactccgcg cgtcccgagc tcccggactc 240 
tgcagtgggc ccggggtcca gggagccgct aagggtccct gaagctgtgg ccctagagcg 300 
gcggcgggag caggaagaaa aggaggacat ggagacccag gctgtggcaa cgtcccccga 360 
tggccgatac ctcaagtttg acatcgagat tggacgtggc tccttcaaga cggtgtatcg 420 
agggctagac accgacacca cagtggaggt ggcctggtgt gagctgcaga ctcggaaact 480 
gtctagagct gagcggcagc gcttctcaga ggaggtggag atgctcaagg ggctgcagca 540 
ccccaacatc gtccgcttct atgattcgtg gaagtcggtg ctgaggggcc aggtttgcat 600 
cgtgctggtc accgaactca tgacctcggg cacgctcaag acgtacctga ggcggttccg 660 
ggagatgaag ccgcgggtcc ttcagcgctg gagccgccaa atcctgcggg gacttcattt 720 
cctacactcc cgggttcctc ccatcctgca ccgggatctc aagtgcgaca atgtctttat 780 
cacgggacct tctggctctg tcaaaatcgg ggacctgggc ctggccacgc tcaagcgcgc 840 
ctcctttgcc aagagtgtca tcgggacccc ggaattcatg gcccccgaga tgtacgagga 900 
aaagtacgat gaggccgtgg acgtgtacgc gttcggcatg tgcatgctgg agatggccac 960 
ctctgagtac ccgtactccg agtgccagaa tgccgcgcaa atctaccgca aggtcacttc 1020 
gggcagaaag ccgaacagct tccacaaggt gaagataccc gaggtgaagg agatcattga 1080 
aggctgcatc cgcacggata agaacgagag gttcaccatc caggacctcc tggcccacgc 1140 
cttcttccgc gaggagcgcg gtgtgcacgt ggaactagcg gaggaggacg acggcgagaa 1200 
gccgggcctc aagctctggc tgcgcatgga ggacgcgcgg cgcggggggc gcccacggga 1260 
caaccaggcc atcgagttcc tgttccagct gggccgggac gcggccgagg aggtggcaca 1320 
ggagatggtg gctctgggct tggtctgtga agccgattac cagccagtgg cccgtgcagt 1380 
acgtgaacgg gttgctgcca tccagcgaaa gcgtgagaag ctgcgtaaag caagggaatt 1440 
ggaggcactc ccaccagagc caggacctcc accagcaact gtgcccatgg cccccggtcc 1500 
ccccagtgtc ttcccccctg agcctgagga gccagaggca gaccagcacc agcccttcct 1560 
tttccgccac gccagctact catctaccac ttcggattgc gagactgatg gctacctcag 1620 
ctcctccggc ttcctggatg cctcagaccc tgcccttcag ccccctgggg gggtgccatc 1680 
cagcctggct gagtcccatc tctgcctgcc ctcggctttt gccctatcca ttccacgttc 1740 
tggccctgga agtgactttt cccccgggga cagctatgcc tcagatgcag cttcaggcct 1800 
tagcgatgtg ggagaaggga tgggacaaat gaggagaccc ccagggagga atctccggcg 1860 
cagaccccga tcccggctgc gggtcactag tgtctcagac cagaatgaca gagtggttga 1920 
gtgccagcta cagacccata acagcaagat ggtgaccttc cgatttgatc tggatgggga 1980 
cagcccggaa gagattgcag ctgccatggt atataacgag ttcattctgc cttcggagcg 2040 
agatggattt ctcagacgga ttcgggagat tatccagcga gtggagaccc tgttgaagag 2100 
agacactggc cccatggagg ctgctgaaga caccctaagc ccccaggagg agccagcacc 2160 
attacctgcc ctgcccgtcc ccctcccaga cccatccaat gaagagctcc agagcagcac 2220 
ctccctggag cacaggagct ggacagcctt ctccacctcc tcatcttctc ctggaactcc 2280 
tttgtctcct ggaaacccat tttcccctgg aacccccatt tccccaggtc ccatcttccc 2340 
catcacttct cccccatgtc atcccagccc ctccccattc tcccccattt cttcccaggt 2400 
ctcctcaaat ccctctccac accccaccag ctctccactt ccattctcct ccagcacacc 2460 
cgagtttccg gtcccactct ctcagtgtcc ctggagttct ctccccacga cttctccacc 2520 
tacgttctct cccacttgtt ctcaggtcac tcttagttcc cctttctttc ctccgtgccc 2580 
ctccacttct tccttcccct ccaccacagc agcccctctc ctttctctgg ctagtgcctt 2640 
ctcactggct gtgatgactg tggcccagtc cctgctgtcc ccctcacctg ggctcctttc 2700 
ccagtctcct ccagcccctc ctagtcccct ccctagcctg ccccttcccc ctcccgttgc 2760 
tcctggtggc caggaaagcc cttcacccca cacagctgag gtggagagtg aggcctcacc 2820 
acctcctgct cggcccctcc caggggaagc caggctggcg cccatctctg aagagggaaa 2880 
gccgcagctt gttgggcgtt tccaagtgac ttcatccaag gaaccggctg agcctcttcc 2940 
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cttgcagcca acatccccca 
ttcagagagc tcagatacag 
ggctgagagc gaccgtgcag 
tgggaaggaa ccccaagttg 
gatgaactac tcctacagca 
agatgaggag ttctgggctg 
aacactacag acactacaga 
gcccccaccg ggtattgtgg 
caagggcagc ttccccacct: 
tggcatcatg cgaaggaact 
aagcaagggg gtgacattcg 
tgtatctccc ccacaccagg 
tgatcaacaa aactgagcaa 
atggagagtg cagctccatt 
gaggtgttag ttctgctgcc 
ccactaagca atcccaccca 
agtgtagggg atatgctcac 
aaaaaaaaa 

<210> 40 
<211> 1803 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 
<223> Incyte ID No: 7474648CB1 

<400> 40 

atgggtgaaa gtggaaacca tcattttcag caaactaaca caggaacaga aaaccaaaca 60 
gcacatgttc tcactcataa gtgggagttg gacaatgaaa acatatgggc acagggaggg 120 
gaacatcaca aactgggacc tgtcatgggt tggaaggcta ggagtgggaa aacattagga 180 
gaaataccta acgtaggcac actcacactc ctcactggct atgggggatg ccagctgcca 240 
tgctgcaagg acactcaggc agcctatgga gaaacccacg tggtgcggag tggaggcctt 300 
ctgccaacag ccagctggga actgaggcct gctgacagtc acacggtgac cagcgatgat 360 
ccaggcgtct cggtcgttag cgggtatcct gggggctgtc tccctgacca cgacccccca 420 
gtggggtttc tttccgaggg tcccgcccct cgcagctgct ctttgataaa gggcggagga 480 
acggggctgg ctgcttcccg agtccccagg tcccgcgagc ggcgggcgtg ttgcgggtat 540 
ggggtgcggc gccagcagga aggtggtccc ggggccacca gcgctggctt gggccaagca 600 
cgaaggtcaa aaccaagccg gcgtcggagg cgcggggcct gggcccgagg cggcggccca 660 
ggcggcgcag aggatacagg tggctcgctt ccgagccaag ttcgaccccc gggtccttgc 720 
cagtgcccag tacaatttct ctttgacatc tctgaacagg gagttcagag gatgggaaaa 780 
aagagagcag gagcagcagc aaacaaggga aggaattcct atcttcggag atatgacatc 840 
aaagctctta ttgggacagg cagtttcagc agggttgtca gggtagagca gaagaccacc 900 
aagaaacctt ttgcaataaa agtgatggaa accagagaga gggaaggtag agaagcgtgc 960 
gtgtctgagc tgagcgtcct gcggcgggtt agccatcgtt acattgtcca gctcatggag 1020 
atctttgaga ctgaggatca agtttacatg gtaatggagc tggctaccgg aggggagctc 1080 
tttgatcgac tcattgctca gggatccttt acagagcggg atgccgtcag gatcctccag 1140 
atggttgctg atgggattag gtatttgcat gcgctgcaga taactcatag gaatctaaag 1200 
cctgaaaacc tcttatacta tcatccaggt gaagagtcga aaattttaat tacagatttt 1260 
ggtttggcat actccgggaa aaaaagtggt gactggacaa tgaagacact ctgtgggacc 1320 
ccagagtaca tagctcctga ggttttgcta aggaagcctt ataccagtgc agtggacatg 1380 
tgggctcttg gtgtgatcac atatgcttta cttagcggat tcctgccttt tgatgatgaa 1440 
agccagacaa ggctttacag gaagattctg aaaggcaaat ataattatac aggagagcct 1500 
tggccaagca tttcccactt ggcgaaggac tttatagaca aac tact gat tttggaggct 1560 
ggtcatcgca tgtcagctgg ccaggccctg gaccatccct gggtgatcac catggctgca 1620 



ctctctctgg ttctccaaaa 
aggacagtgc tggaggcggg 
ctgagggtct gggggctgga 
ggggcagccc ccaacccctg 
gcctgtgttt gagcagcgag 
agctgcagag tcttcggcag 
aaaaagaaat tgaagatttg 
ccccagctgc tatgctgtcc 
cccgccgcaa cagcctacag 
ctctgagtgg cagcagcacc 
ccggggatgt tggcaggatg 
gcccaccatg gagcttgtgt 
ggaagatccc aacactgaag 
atagtgaaga gccaaacata 
taccatcttc atctctagca 
agcctggatg cttctagagg 
accacattag cagcaaccaa 



ccttcaaccc ctcagctcac 3000 
ccagagacca gggaagctct 3060 
gttgaggagg aaggagatga 3120 
agccatccca gcccagtgtg 3180 
gagtcagaaa gcagtgggga 3240 
aagcacttgt cagaggtgga 3300 
tacagccggc tggggaagca 3360 
agccgccagc gccgcctctc 3420 
cgctctgagc ccccaggccc 3480 
ggctcccagg agcagcgggc 3540 
tgaattcaga acagaagcca 3600 
tctcagaatc tgatgctttc 3660 
gggtagaagg ccaggggggc 3720 
tgtgaactgt ttgctgtgtg 3780 
cctcccctgc caagagtcaa 3840 
ggcccactcc cagctgggag 3900 
taaaaatgct ggaaacaaga 3960 

3969 
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gggtcttcca tgaagaatct ccagagggcc atatcccgaa acctcatgca gagggcctct 1680 
ccccactctc agagtcctgg atctgcacag tcttctaagt cacattattc tcacaaatcc 1740 
aggcatatgt ggagcaagag aaacttaagg atagtagaat cgccactgtc tgcgcttttg 1800 
taa 1803 



<210> 41 
<211> 3472 
<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc^f eature 

<223> Incyte ID No: 7483053CB1 



<400> 41 

atggcgaagg cgacgtccgg tgccgcgggg ctgcg.tctgc tgttgctgct gctgctgccg 60 
ctgctaggca aagtggcatt gggcctctac ttctcgaggg atgcttactg ggagaagctg 120 
tatgtggacc aggcagccgg cacgcccttg ctgtacgtcc atgccctgcg ggacgcccct 180 
gaggaggtgc ccagcttccg cctgggccag catctctacg gcacgtaccg aacacggctg 240 
catgagaaca actggatctg catccaggag gacaccggcc tcctctacct taaccggagc 300 
ctggaccata gctcctggga gaagctcagt gtccgcaacc gcggctttcc cctgctcacc 360 
gtctacctca aggtcttcct gtcacccaca tcccttcgtg agggcgagtg ccagtggcca 420 
ggctgtgccc gcgtatactt ctccttcttc aacacctcct ttccagcctg cagctccctc 480 
aagccccggg agctctgctt cccagagaca aggccctcct tccgcattcg ggagaaccga 540 
cccccaggca ccttccacca gttccgcctg ctgcctgtgc agttcttgtg ccccaacatc 600 
agcgtggcct acaggctcct ggagggtgag ggtctgccct tccgctgcgc cccggacagc 660 
ctggaggtga gcacgcgctg ggccctggac cgcgagcagc gggagaagta cgagctggtg 720 
gccgtgtgca ccgtgcacgc cggcgcgcgc gaggaggtgg tgatggtgcc cttcccggtg 780 
accgtgtacg acgaggacga ctcggcgccc accttccccg cgggcgtcga caccgccagc 840 
gccgtggtgg agttcaagcg gaaggaggac accgtggtgg ccacgctgcg tgtcttcgat 900 
gcagacgtgg tacctgcatc aggggagctg gtgaggcggt acacaagcac gctgctcccc 960 
ggggacacct gggcccagca gaccttccgg gtggaacact ggcccaacga gacctcggtc 1020 
caggccaacg gcagcttcgt gcgggcgacc gtacatgact ataggctggt tctcaaccgg 1080 
aacctctcca tctcggagaa ccgcaccatg cagctggcgg tgctggtcaa tgactcagac 1140 
ttccagggcc caggagcggg cgticctcttg ctccacttca acgtgtcggt gctgccggtc 1200 
agcctgcacc tgcccagtac ctactccctc tccgtgagca ggagggctcg ccgatttgcc 1260 
cagatcggga aagtctgtgt ggaaaactgc caggcgttca gtggcatcaa cgtccagtac 1320 
aagctgcatt cctctggtgc caactgcagc acgctagggg tggtcacctc agccgaggac 1380 
acctcgggga tcctgtttgt gaatgacacc aaggccctgc ggcggcccaa gtgtgccgaa 1440 
cttcactaca tggtggtggc caccgaccag cagacctcta ggcaggccca ggcccagctg 1500 
cttgtaacag tggaggggtc atatgtggcc gaggaggcgg gctgccccct gtcctgtgca 1560 
gtcagcaaga gacggctgga gtgtgaggag tgtggcggcc tgggctcccc aacaggcagg 1620 
tgtgagtgga ggcaaggaga tggcaaaggg atcaccagga acttctccac ctgctctccc 1680 
agcaccaaga cctgccccga cggccactgc gatgttgtgg agacccaaga catcaacatt 1740 
tgccctcagg actgcctccg gggcagcatt gttgggggac acgagcctgg ggagccccgg 1800 
gggattaaag ctggctatgg cacctgcaac tgcttccctg aggaggagaa gtgcttctgc 1860 
gagcccgaag acatccagga tccactgtgc gacgagctgt gccgcacggt gatcgcagcc 1920 
gctgtcctct tctccttcat cgtctcggtg ctgctgtctg ccttctgcat ccactgctac 1980 
cacaagtttg cccacaagcc acccatctcc tcagctgaga tgaccttccg gaggcccgcc 2040 
caggccttcc cggtcagcta ctcctcttcc agtgcccgcc ggccctcgct ggactccatg 2100 
gagaaccagg tctccgtgga tgccttcaag atcctggagg atccaaagtg ggaattccct 2160 
cggaagaact tggttcttgg aaaaactcta ggagaaggcg aatttggaaa agtggtcaag 2220 
gcaacggcct tccatctgaa aggcagagca gggtacacca cggtggccgt gaagatgctg 2280 
aaagagaacg cctccccgag tgagcttcga gacctgctgt cagagttcaa cgtcctgaag 2340 
caggtcaacc acccacatgt catcaaattg tatggggcct gcagccagga tggcccgctc 2400 
ctcctcatcg tggagtacgc caaatacggc tccctgcggg gcttcctccg cgagagccgc 2460 
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aaagtggggc ctggctacct gggcagtgga ggcagccgca actccagctc cctggaccac 2520 
ccggatgagc gggccctcac catgggcgac ctcatctcat ttgcctggca gatctcacag 2580 
gggatgcagt atctggccga gatgaagctc gttcatcggg acttggcagc cagaaacatc 2640 
ctggtagctg aggggcggaa gatgaagatt tcggatttcg gcttgtcccg agatgtttat 2700 
gaagaggatt cgtacgtgaa gaggagccag ggtcggattc cagttaaatg gatggcaatt 2760 
gaatcccttt ttgatcatat ctacaccacg caaagtgatg tatggtcttt tggtgtcctg 2820 
ctgtgggaga tcgtgaccct agggggaaac ccctatcctg ggattcctcc tgagcggctc 2880 
ttcaaccttc tgaagaccgg ccaccggatg gagaggccag acaactgcag cgaggagatg 2940 
taccgcctga tgctgcaatg ctggaagcag gagccggaca aaaggccggt gtttgcggac 3000 
atcagcaaag acctggagaa gatgatggtt aagaggagag actacttgga ccttgcggcg 3060 
tccactccat ctgactccct gatttatgac gacggcctct cagaggagga gacaccgctg 3120 
gtggactgta ataatgcccc cctccctcga gccctccctt ccacatggat tgaaaacaaa 3180 
ctctatggca tgtcagaccc gaactggcct ggagagagtc ctgtaccact cacgagagct 3240 
gatggcacta acactgggtt tccaagatat ccaaatgata gtgtatatgc taactggatg 3300 
ctttcaccct cagcggcaaa attaatggac acgtttgata gttaacattt ctttgtgaaa 3360 
ggtaatggac tcacaagggg aagaaacatg ctgagaatgg aaagtctacc ggccctttct 3420 
ttgtgaacgt cacattggcc gagccgtgtt cagttcccag gtggcagact eg 3472 

<210> 42 

<211> 1704 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 7483117CB1 

<400> 42 

atggatgaca aagatattga caaagaacta aggcagaaat taaacttttc ctattgtgag 60 
gagactgaga ttgaagggca gaagaaagta gaagaaagca gggaggcttic gagccaaacc 120 
ccagagaagg gtgaagtgca ggattcagag gcaaagggta caccaccttg gactcccctt 180 
agcaacgtgc aCgagctcga cacatcttcg gaaaaagaca aagaaagtcc agatcagatt 240 
ttgaggactc cagtgtcaca ccctctcaaa tgtcctgaga caccagccca accagacagc 300 
aggagcaagc tgctgcccag tgacagcccc tctactccca aaaccatgct gagccggttg 360 
gtgatttctc caacagggaa gcttccttcc agaggcccta agcatttgaa gctcacacct 420 
gctcccctca aggatgagat gacctcattg gctctggtca atattaatcc cttcactcca 480 
gagtcctata aaaaattatt tcttcaatct ggtggcaaga ggaaaataag aggagatctt 540 
gaggaagctg gtccagagga aggcaaggga gggctgcctg ccaagagatg tgttttacga 600 
gaaaccaaca tggcttcccg ctatgaaaaa gaattcttgg aggttgaaaa aattggggtt 660 
ggcgaatttg gtacagtcta caagtgcatt aagaggctgg atggatgtgt ttatgcaata 720 
aagcgctcta tgaaaacttt tacagaatta tcaaatgaga attcggcttt gcatgaagtt 780 
tatgctcacg cagtgcttgg gcatcacccc catgtggtac gttactattc ctcatgggca 840 
gaagatgacc acatgatcat tcagaatgaa tactgcaatg gtgggagttt gcaagctgct 900 
atatctgaaa acactaagtc tggcaatcat tttgaagagc caaaactcaa ggacatcctt 960 
ctacagattt cccttggcct taattacatc cacaactcta gcatggtaca cctggacatc 1020 
aaacctagta atatattcat ttgtcacaag atgcaaagtg aatcctctgg agtcatagaa 1080 
gaagttgaaa atgaagctga ttggtttctc tctgccaatg tgatgtataa aattggtgac 1140 
ctgggccacg caacatcaat aaacaaaccc aaagtggaag aaggagatag tcgcttcctg 1200 
gctaatgaga ttttgcaaga ggattaccgg caccttccca aagcagacat atttgccttg 1260 
ggattaacaa ttgcagtggc tgcaggagca gagtcattgc ccaccaatgg tgctgcatgg 1320 
caccatatcc gcaagggtaa ctttccggac gttcctcagg agctctcaga aagcttttcc 1380 
agtictgctca agaacatgat ccaacctgat gccgaacaga gaccttctgc agcagctctg 1440 
gccagaaata cagttctccg gccttccctg ggaaaaacag aagagctcca acagcagctg 1500 
aatttggaaa agttcaagac tgccacactg gaaagggaac tgagagaagc ccagcaggcc 1560 
cagtcacccc agggatatac ccatcatggt gacactgggg tctctgggac ccacacagga 1620 
tcaagaagca caaaacgcct ggtgggagga aagagtgcaa ggtcttcaag ctttacctca 1680 



71/75 



wo 02/33099 



PCTAJSOl/47728 



ggagagcgtg agcctctgca ttaa 

<210> 43 

<211> 6298 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 748449 8CB1 



1704 



<400> 43 

cgcggggcgg aacagatcgc agacctgggg gttcgcagag ccgccagtgg ggagatgttg 60 
aagttcaaat atggagcgcg gaatcctttg gatgctggtg ctgctgaacc cattgccagc 120 
cgggcctcca ggctgaatct gttcttccag gggaaaccac cctttatgac tcaacagcag 180 
atgtctcctc tttcccgaga agggatatta gatgccctct ttgttctctt tgaagaatgc 240 
agtcagcctg ctctgatgaa gattaagcac gtgagcaact ttgtccggaa gtattccgac 300 
accatagctg agttacagga gctccagcct tcggcaaagg acttcgaagt cagaagtctt 360 
gtaggttgtg gtcactttgc tgaagtgcag gtggtaagag agaaagcaac cggggacatc 420 
tatgctatga aagtgatgaa gaagaaggct ttattggccc aggagcaggt ttcatttttt 480 
gaggaagagc ggaacatatt atctcgaagc acaagcccgt ggatccccca attacagtat 540 
gcctttcagg acaaaaatca cctttatctg gtcatggaat atcagcctgg aggggacttg 600 
ctgtcacttt tgaatagata tigaggaccag ttagatgaaa acctgataca gttttaccta 660 
gctgagctga ttttggctgt tcacagcgtt: catctgatgg gatacgtgca tcgagacatc 720 
aagcctgaga acattctcgt tgaccgcaca ggacacatca agctggtgga ttttggatct 780 
gccgcgaaaa tgaattcaaa caagatggtg aatgccaaac tcccgattgg gaccccagat 840 
tacatggctc ctgaagtgct gactgtgatg aacggggatg gaaaaggcac ctacggcctg 900 
gactgtgact ggtggtcagt gggcgtgatt gcctatgaga tgatttatgg gagatccccc 960 
ttcgcagagg gaacctctgc cagaaccttc aataacatta tgaatttcca gcggtttttg 1020 
aaatttccag atgaccccaa agtgagcagt gactttcttg atctgattca aagcttgttg 1080 
t^9<=^99ccaga aagagagact gaagtttgaa ggtctttgct gccatccttt cttctctaaa 1140 
attgactgga acaacattcg taactctcct ccccccttcg ttcccaccct caagtctgac 1200 
gatgacacct ccaattttga tgaaccagag aagaattcgt gggtttcatc ctctccgtgc 1260 
cagctgagcc cctcaggctt ctcgggtgaa gaactgccgt ttgtggggtt ttcg^acagc 1320 
aaggcactgg ggattcttgg tagatctgag tctgttgtgt cgggtctgga ctcccctgcc 1380 
aagactagct ccatggaaaa gaaacttctc atcaaaagca aagagctaca agactctcag 1440 
gacaagtgtc acaagatgga gcaggaaatg acccggttac atcggagagt gtcagaggtg 1500 
gaggctgtgc ttagtcagaa ggaggtggag ctgaaggcct ctgagactca gagatccctc 1560 
ctggagcagg accttgctac ctacatcaca gaatgcagta gcttaaagcg aagtttggag 1620 
caagcacgga tggaggtgtc ccaggaggat gacaaagcac tgcagcttct ccatgatatc 1680 
agagagcaga gccggaagct ccaagaaatc aaagagcagg agtaccaggc tcaagtggaa 1740 
gaaatgaggt tgatgatgaa tcagttggaa gaggatcttg tctcagcaag aagacggagt 1800 
gatctctacg aatctgagct gagagagtct cggcttgctg ctgaagaatt caagcggaaa 1860 
gcgacagaat gtcagcataa actgttgaag gctaaggatc aagggaagcc tgaagtggga 1920 
gaatatgcga aactggagaa gatcaatgct gagcagcagc tcaaaattca ggagctccaa 1980 
gagaaactgg agaaggctgt aaaagccagc acggaggcca ccgagctgct gcagaatatc 2040 
cgccaggcaa aggagcgagc cgagagggag ctggagaagc tgcagaaccg agaggattct 2100 
tctgaaggca tcagaaagaa gctggtggaa gctgaggaac gccgccattc tctggagaac 2160 
aaggtaaaga gactagagac catggagcgt agagaaaaca gactgaagga tgacatccag 2220 
acaaaatccc aacagatcca gcagatggct gataaaattc tggagctcga agagaaacat 2280 
cgggaggccc aagtctcagc ccagcaccta gaagtgcacc tgaaacagaa agagcagcac 2340 
tatgaggaaa agattaaagt gttggacaat cagataaaga aagacctggc tgacaaggag 2400 
acactggaga acatgatgca gagacacgag gaggaggccc atgagaaggg caaaattctc 2460 
agcgaacaga aggcgatgat caatgctatg gattccaaga tcagatccct ggaacagagg 2520 
attgtggaac tgtctgaagc caataaactt gcagcaaata gcagtctttt tacccaaagg 2580 
aacatgaagg cccaagaaga gatgatttct gaactcaggc aacagaaatt ttacctggag 2640 
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acacaggctg ggaagttgga ggcccagaac cgaaaactgg aggagcagct ggagaagatc 2700 
agccaccaag accacagtga caagaatcgg ctgctggaac tggagacaag attgcgggag 2760 
gtcagtctag agcacgagga gcagaaactg gagctcaagc gccagctcac agagctacag 2820 
ctctccctgc aggagcgcga gtcacagttg acagccctgc aggctgcacg ggcggccctg 2880 
gagagccagc ttcgccaggc gaagacagag ctggaagaga ccacagcaga agctgaagag 2940 
gagatccagg cactcacggc acatagagat gaaatccagc gcaaatttga tgctcttcgt 3000 
aacagctgta ctgtaatcac agacctggag gagcagctaa accagctgac cgaggacaac 3060 
gctgaactca acaaccaaaa cttctacttg tccaaacaac tcgatgaggc ttctggcgcc 3120 
aacgacgaga ttgtacaact gcgaagtgaa gtggaccatc tccgccggga gatcacggaa 3180 
cgagagatgc agcttaccag ccagaagcaa acgatggagg ctctgaagac cacgtgcacc 3240 
atgctggagg aacaggtcat ggatttggag gccctaaacg atgagctgct agaaaaagag 3300 
cggcagtggg aggcctggag gagcgtcctg ggtgatgaga aatcccagtt tgagtgtcgg 3360 
gttcgagagc tgcagaggat gctggacacc gagaaacaga gcagggcgag agccgatcag 3420 
cggatcaccg agtctcgcca ggtggtggag ctggcagtga aggagcacaa ggctgagatt 3480 
ctcgctctgc agcaggctct caaagagcag aagctgaagg ccgagagcct ctctgacaag 3540 
ctcaatgacc tggagaagaa gcatgctatg cttgaaatga atgcccgaag cttacagcag 3600 
aagctggaga ctgaacgaga gctcaaacag aggcttctgg aagagcaagc caaatCacag 3660 
cagcagatgg acctgcagaa aaatcacatt ttccgtctga ctcaaggact gcaagaagct 3720 
ctagatcggg ctgatctact gaagacagaa agaagtgact tggagtatca gctggaaaac 3780 
attcaggttc tctattctca tgaaaaggtg aaaatggaag gcactatttc tcaacaaacc 3840 
aaactcattg attttctgca agccaaaatg gaccaacctg ctaaaaagaa aaaggttcct 3900 
ctgcagtaca atgagctgaa gctggccctg gagaaggaga aagctcgctg tgcagagcta 3960 
gaggaagccc ttcagaagac ccgcatcgag ctccggtccg cccgggagga agctgcccac 4020 
cgcaaagcaa cggaccaccc acacccatcc acgccagcca ccgcgaggca gcagatcgcc 4080 
atgtccgcca tcgtgcggtc gccagagcac cagcccagtg ccatgagcct gctggccccg 4140 
ccatccagcc gcagaaagga gtcttcaact ccagaggaat ttagtcggcg tcttaaggaa 4200 
cgcatgcacc acaatattcc tcaccgattc aacgtaggac tgaacatgcg agccacaaag 4260 
tgtgctgtgt gtctggatac cgtgcacttt ggacgccagg catccaaatg tctcgaatgt 4320 
caggtgatgt gtcaccccaa gtgctccacg tgcttgccag ccacctgcgg cttgcctgct 4380 
gaatatgcca cacacttcac cgaggccttc tgccgtgaca aaatgaactc cccaggtctc 4440 
cagaccaagg agcccagcag cagcttgcac ctggaagggt ggatgaaggt gcccaggaat 4500 
aacaaacgag gacagcaagg ctgggacagg aagtacattg tcctggaggg atcaaaagtc 4560 
ctcatttatg acaatgaagc cagagaagct ggacagaggc cggtggaaga atttgagctg 4620 
tgccttcccg acggggatgt atctattcat ggtgccgttg gtgcttccga actcgcaaat 4680 
acagccaaag cagatgtccc atacatactg aagatggaat ctcacccgca caccacctgc 4740 
tggcccggga gaaccctcta cttgctagct cccagcttcc ctgacaaaca gcgctgggtc 4800 
accgccttag aatcagttgt cgcaggtggg agagtttcta gggaaaaagc agaagctgat 4860 
gctaaactgc ttggaaactc cctgctgaaa ctggaaggtg atgaccgtct agacatgaac 4920 
tgcacgctgc ccttcagtga ccaggtggtg ttggtgggca ccgaggaagg gctctacgcc 4980 
ctgaatgtct tgaaaaactc cctaacccat gtcccaggaa ttggagcagt cttccaaatt 5040 
tatattatca aggacctgga gaagctactc atgatagcag gagaagagcg ggcactgtgt 5100 
cttgtggacg tgaagaaagt gaaacagtcc ctggcccagt cccacctgcc tgcccagccc 5160 
gacatctcac ccaacatttt tgaagctgtc aagggctgcc acttgtttgg ggcaggcaag 5220 
attgagaacg ggctctgcat ctgtgcagcc atgcccagca aagtcgtcat tctccgctac 5280 
aacgaaaacc tcagcaaata ctgcatccgg aaagagatag agacctcaga gccctgcagc 5340 
tgtatccact tcaccaatta cagtatcctc attggaacca ataaattcta cgaaatcgac 5400 
atgaagcagt acacgctcga ggaattcctg gataagaatg accattcctt ggcacctgct 5460 
gtgtttgccg cctcttccaa cagcttccct gtctcaatcg tgcaggtgaa cagcgcaggg 5520 
cagcgagagg agtacttgct gtgtttccac gaatttggag tgttcgtgga ttcttacgga 5580 
agacgtagcc gcacagacga tctcaagtgg agtcgcttac ctttggcctt tgcctacaga 5640 
gaaccctatc tgtttgtgac ccacttcaac tcactcgaag taattgagat ccaggcacgc 5700 
tcctcagcag ggacccctgc ccgagcgtac ctggacatcc cgaacccgcg ctacctgggc 5760 
cctgccattt cctcaggagc gatttacttg gcgtcctcat accaggataa attaagggtc 5820 
atttgctgca agggaaacct cgtgaaggag tccggcactg aacaccaccg gggcccgtcc 5880 
acctcccgca gcagccccaa caagcgaggc ccacccacgt acaacgagca catcaccaag 5940 
cgcgtggcct ccagcccagc gccgcccgaa ggccccagcc acccgcgaga gccaagcaca 6000 
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ccccaccgct accgcgaggg gcggaccgag ctgcgcaggg acaagtctcc tggccgcccc 6060 
ctggagcgag agaagtcccc cggccggatg ctcagcacgc ggagagagcg gtcccccggg 6120 
aggctgtttg aagacagcag caggggccgg ctgcctgcgg gagccgtgag gaccccgctg 6180 
tcccaggtga acaaggtctg ggaccagtct tcagtataaa tctcagccag aaaaaccaac 6240 
tcctcatctt gatctgcagg aaaacaccaa acacactatg gaactctgct gatgggga 6298 



<210> 44 
<211> 5454 
<212> DNA 

<213> Hozno sapiens 



<220> 

<221> misc_feature 

<223> Incyte ID No: 7638121CB1 



<400> 44 

cacgcacacc gcacgtacgg ggttgggccc agctgggtta taagcgtgat ccccatgccc 60 
cctgcccagg ctggggggca tttgcacatc tgcaaaggcc tcccagcctg tcccagccct 120 
gccccagcct gggaccccca cattctactc accgtgtctc ctcagagggg ccagaaccct 180 
ccactgggga gaggcaagtg gcggtgaact tggtgtccat aggaccctgt ccctgagagc 240 
gacagctgag ttagtgagct ccactggccc caccaactcc ttctgatcac ctggccagct 300 
gaggtcagag tgggagaggc agtggttcca ttgaaggagt actcctaact gtcagaagcc 360 
tgggcggtca ggatggggtg ctgtcgcttg ggctgcgggg ggtgttcagt tgcccacagt 420 
gtatctcagg gtctcaccaa ccatccaagc atggtaggct gtggctggca cccagggttg 480 
tgtggctggg gaggtggtct ccacagttcc ctccctgccc tcccagggcc cccatccatg 540 
caggtaacca tcgaggatgt gcaggcacag acaggcggaa cggcccaatt cgaggctatc 600 
attgagggcg acccacagcc ctcggtgacc tggtacaagg acagcgtcca gctggtggac 660 
agcacccggc ttagccagca gcaagaaggc accacatact ccctggtgct gaggcatatg 720 
gcctcgaagg atgccggcgt ttacacctgc ctggcccaaa acactggtgg ccaggtgctc 780 
tgcaaggcag agctgctggt gcttgggggg gacaatgagc cggactcaga gaagcaaagc 840 
caccggagga agctgcactc cttctatgag gtcaaggagg agattggaag gggcgtgttt 900 
ggcttcgtaa aaagagtgca gcacaaagga aacaagatct tgtgcgctgc caagttcatc 960 
cccctacgga gcagaactcg ggcccaggca tacagggagc gagacatcct ggccgcgctg 1020 
agccacccgc tggtcacggg gctgctggac cagtttgaga cccgcaagac cctcatcctc 1080 
atcctggagc tgtgctcatc cgaggagctg ctggaccgcc tgtacaggaa gggcgtggtg 1140 
acggaggccg aggtcaaggt ctacatccag cagctggtgg aggggctgca ctacctgcac 1200 
agccatggcg ttctccacct ggacataaag ccctctaaca tcctgatggt gcatcctgcc 1260 
cgggaagaca ttaaaatctg cgactttggc tttgcccaga acatcacccc agcagagctg 1320 
cagttcagcc agtacggctc ccctgagttc gtctcccccg agatcatcca gcagaaccct 1380 
gtgagcgaag cctccgacat ttgggccatg ggtgtcatct cctacctcag cctgacctgc 1440 
tcatccccat ttgccggcga gagtgaccgt gccaccctcc tgaacgtcct ggaggggcgc 1500 
gtgtcatgga gcagccccat ggctgcccac ctcagcgaag acgccaaaga cttcatcaag 1560 
gctacgctgc agagagcccc tcaggcccgg cctagtgcgg cccagtgcct ctcccacccc 1620 
tggttcctga aatccatgcc tgcggaggag gcccacttca tcaacaccaa gcagctcaag 1680 
ttcctcctgg cccgaagtcg ctggcagcgt tccctgatga gctacaagtc catcctggtg 1740 
atgcgctcca tccctgagct gctgcggggc ccacccgaca gcccctccct cggcgtagcc 1800 
cggcacctct gcagggacac tggtggctcc tccagttcct cctcctcctc tgacaacgag 1860 
ctcgccccat ttgcccgggc taagtcactg ccaccctccc cggtgacaca ctcaccactg 1920 
ctgcaccccc ggggcttcct gcggccctcg gccagcctgc ctgaggaagc cgaggccagt 1980 
gagcgctcca ccgaggcccc agctccgcct gcatctcccg agggtgccgg gccaccggcc 2040 
gcccagggct gcgtgccccg gcacagcgtc atccgcagcc tgttctacca ccaggcgggt 2100 
gagagccctg agcacggggc cctggccccg gggagcaggc ggcacccggc ccggcggcgg 2160 
cacctgctga agggcgggta cattgcgggg gcgctgccag gcctgcgcga gccactgatg 2220 
gagcaccgcg tgctggagga ggaggccgcc agggaggagc aggccaccct cctggccaaa 2280 
gccccctcat tcgagactgc cctccggctg cctgcctctg gcacccactt ggcccctggc 2340 
cacagccact ccctggaaca tgactctccg agcacccccc gcccctcctc ggaggcctgc 2400 
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ggtgaggcac agcgactgcc ttcagccccc tccggggggg cccctatcag ggacatgggg 2460 
caccctcagg gctccaagca gcttccatcc actggtggcc acccaggcac tgctcagcca 2520 
gagaggccat ccccggacag cccttggggg cagccagccc ctttctgcca ccccaagcag 2580 
ggttctgccc cccaggaggg ctgcagcccc cacccagcag ttgccccatg ccctcctggc 2640 
tccttccctc caggatcttg caaagaggcc cccttagtac cctcaagccc cttcttggga 2700 
cagccccagg caccccttgc ccctgccaaa gcaagccccc cattggactc taagatgggg 2760 
cctggagaca tctctcttcc tgggaggcca aaacccggcc cctgcagttc cccagggtca 2820 
gcctcccagg cgagctcttc ccaagtgagc tccctcaggg tgggctcctc ccaggtgggc 2880 
acagagcctg gcccctccct ggatgcggag ggctggaccc aggaggctga ggatctgtcc 2940 
gactccacac ccaccttgca gcggcctcag gaacaggtga ccatgcgcaa gttctccctg 3000 
ggtggtcgcg ggggctacgc aggcgtggct ggctatggca cctttgcctt tggtggagat 3060 
gcagggggca tgctggggca ggggcccatg tgggccagga tagcctgggc tgtgtcccag 3120 
tcggaggagg aggagcagga ggaggccagg gctgagtccc agtcggagga gcagcaggag 3180 
gccagggctg agagcccact gccccaggtc agtgcaaggc ctgtgcctga ggtcggcagg 3240 
gctcccacca ggagctctcc agagcccacc ccatgggagg acatcgggca ggtctccctg 3300 
gtgcagatcc gggacctgtc aggtgatgcg gaggcggccg acacaatatc cctggacatt 3360 
tccgaggtgg accccgccta cctcaacctc tcagacctgt acgatatcaa gtacctccca 3420 
ttcgagttta tgatcttcag gaaagtcccc aagtccgctc agccagagcc gccctccccc 3480 
atggctgagg aggagctggc cgagttcccg gagcccacgt ggccctggcc aggtgaactg 3540 
ggcccccacg caggcctgga gatcacagag gagtcagagg atgtggacgc gctgctggca 3600 
gaggctgccg tgggcaggaa gcgcaagtgg tcctcgccgt cacgcagcct cttccacttc 3660 
cctgggaggc acctgccgct ggacgagcct gcagagctgg ggctgcgtga gagagtgaag 3720 
gcctccgtgg agcacatctc ccggatcctg aagggcaggc cggaaggtct ggagaaggag 3780 
gggcccccca ggaagaagcc aggccttgct tccttccggc tctcaggtct gaagagctgg 3840 
gaccgagcgc cgacattcct aagggagctc tcagatgaga ctgtggtcct gggccagtca 3900 
gtgacactgg cctgccaggt gtcagcccag ccagctgccc aggccacctg gagcaaagac 3960 
ggagcccccc tggagagcag cagccgtgtc ctcatctctg ccaccctcaa gaacttccag 4020 
cttctgacca tcctggtggt ggtggctgag gacctgggtg tgtacacctg cagcgtgagc 4080 
aatgcgctgg ggacagtgac caccacgggc gtcctccgga aggcagagcg cccctcatct 4140 
tcgccatgcc cggatatcgg ggaggtgtac gcggatgggg tgctgctggt ctggaagccc 4200 
gtggaatcct acggccctgt gacctacatt gtgcagtgca gcctagaagg cggcagctgg 4260 
accacactgg cctccgacat ctttgactgc tgctacctga ccagcaagct ctcccggggt 4320 
ggcacctaca ccttccgcac ggcatgtgtc agcaaggcag gaatgggtcc ctacagcagc 4380 
ccctcggagc aagtcctcct gggagggccc agccacctgg cctctgagga ggagagccag 4440 
gggcggtcag cccaacccct gcccagcaca aagaccttcg cattccagac acagatccag 4500 
aggggccgct tcagcgtggt gcggcaatgc tgggagaagg ccagcgggcg ggcgctggcc 4560 
gccaagatca tcccctacca ccccaaggac aagacagcag tgctgcgcga atacgaggcc 4620 
ctcaagggcc tgcgccaccc gcacctggcc cagctgcacg cagcctacct cagcccccgg 4680 
cacctggtgc tcatcttgga gctgtgctct gggcccgagc tgctcccctg cctggccgag 4740 
agggcctcct actcagaatc cgaggtgaag gactacctgt ggcagatgtt gagtgccacc 4800 
cagtacctgc acaaccagca catcctgcac ctggacctga ggtccgagaa catgatcatc 4860 
accgaataca acctgctcaa ggtcgtggac ctgggcaatg cacagagcct cagccaggag 4920 
aaggtgctgc cctcagacaa gttcaaggac tacctagaga ccatggctcc agagctcctg 4980 
gagggccagg gggctgttcc acagacagac atctgggcca tcggtgtgac agccttcatc 5040 
atgctgagcg ccgagtaccc ggtgagcagc gagggtgcac gcgacctgca gagaggactg 5100 
cgcaaggggc tggtccggct gagccgctgc tacgcggggc tgtccggggg cgccgtggcc 5160 
ttcctgcgca gcactctgtg cgcccagccc tggggccggc cctgcgcgtc cagctgcctg 5220 
cagtgcccgt ggctaacaga ggagggcccg gcctgttcgc ggcccgcgcc cgtgaccttc 5280 
cctaccgcgc ggctgcgcgt cttcgtgcgc aatcgcgaga agagacgcgc gctgctgtac 5340 
aagaggcaca acctggccca ggtgcgctga gggtcgcccc ggccacaccc ttggtctccc 5400 
cgctgggggt cgctgcagac gcgccaataa aaacgcacag ccgggcgaga aaaa 5454 
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